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IMPROVED ADENOVIRUS AMD METHODS OF USE THEREOF 

This invention was supported by the National 
Institute of Health Grant No. P30 DK 47757. The United 
5 states government has rights in this invention. 

F<«id of th*? T" vention 

The present invention relates to the field of 
vectors useful in somatic gene therapy and the production 
10 thereof. 

nariraround of the invention 

Human gene therapy is an approach to treating human 
disease that is based on the modification of gene 
expression in cells of the patient, It has become 
apparent over the last decade that the single most 
outstanding barrier to the success of gene therapy as a 
strategy for treating inherited diseases, cancer, and 
other genetic dysfunctions is the development of useful 
gene transfer vehicles. Eukaryotic viruses have been 
employed as vehicles for somatic gene therapy. Among the 
viral vectors that have been cited frequently in gene 
therapy research are adenoviruses. 

Adenoviruses are eukaryotic DNA viruses that can be 
25 modified to efficiently deliver a therapeutic or reporter 
transgene to a variety of cell types. Recombinant 
adenoviruses types 2 and 5 (Ad2 and Ad5, respectively) , 
which cause respiratory disease in humans, are currently 
being developed for gene therapy. Both Ad2 and Ad5 
30 belong to a subclass of adenovirus that are not 
associated with human malignancies. Recombinant 
adenoviruses are capable of providing extremely high 
levels of transgene delivery to virtually all cell types, 
regardless of the mitotic state. High titers (10 13 
35 plaque forming units/ml) f recombinant virus can be 

easily generated in 293 cells (the adenovirus equivalent 
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to retrovirus packaging cell lines) and cryo-stored for 
extended periods without appreciable losses. The 
efficacy of this system in delivering a therapeutic 
transgene in vivo that complements a genetic imbalance 
5 has been demonstrated in anima^ models of various 
disorders [Y. Watanabe, Atherosclerosis . 3_£: 261-268 
(1986); K. Tanzawa et al, FEBS Letters . 114(1) :81-84 
(1980); J.L. Golasten et al, New Engl. J, Mgd., 
122 ( 11983 ): 288-296 (1983); S. Ishibashi et al, J. Clin. 

10 invest. . 22:883-893 (1993); and S. Ishibashi et al, 
Clin, invest.. 9.2: 1885-1893 (1994)]. Indeed, a 
recombinant replication defective adenovirus encoding a 
cDNA for the cystic fibrosis transmembrane regulator 
(CFTR) has been approved for use in at least two human CF 

15 clinical trials [see, e.g., J. Wilson, Mature . 2££:691- 
692 (Oct. 21, 1993)]. Further support of the safety of 
recombinant adenoviruses for gene therapy is the 
extensive experience of live adenovirus vaccines in human 
populations. 

20 Human adenoviruses are comprised of a linear, 

approximately 36 kb double-stranded DMA genome, which is 
divided into 100 map units (m.u.), each of which is 360 
bp in length. The DNA contains short inverted terminal 
repeats (ITR) at each end of the genome that are required 

25 for viral DNA replication. The gene products are 

organized into early (El through B4) and late (LI through 
LS) regions, based on expression before or after the 
initiation of viral DNA synthesis [see, e.g., Horwitz, 
virology . 2d edit., ed. B. N. Fields, Raven Press, Ltd. , 

30 New York (1990)]. 

The first-generation recombinant, replication- 
deficient adenoviruses which have been developed for gene 
therapy contain deletions of the entire Ela and part of 
the Elb regi ns. This replication-defective virus is 

35 grown n an adenovirus-transformed, complementation human 
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15 


embryonic kidney cell line containing a functional 
adenovirus Ela gene which provides a transacting Ela 
protein, the 293 cell [ATCC CRL1573] . El-deleted viruses 
are capable of replicating and producing infectious virus 
in the 293 cells, which previa Ela and Elb region gene 
products in trans. The resulting virus is capable of 
infecting many cell types and can express the introduced 
gene (providing it carries its own promoter) , but cannot 
replicate in a cell that does not carry the El region DNA 
unless the cell is infected at a very high multiplicity 
of infection. 

However, in vivo studies revealed transgene 
expression in these El deleted vectors was transient and 
invariably associated with the development of severe 
inflammation at the site of vector targeting [S. 
Ishibashi et al, ,T, CUn. Invert., 22:1885-1893 (1994); 
J. M. Wilson et al, FHK7 . Nflt.l . Acffd , ffej.. PSA , 45:4421- 
4424 (1988); J. M. Wilson et al, glln. B i o. , 2:21-26 
(1991) ; M. Grossman et al, Sot. Cellt an d Woli 
20 17:601-607 (1991)]. One explanation that has been 

proposed to explain this finding is that first generation 
recombinant adenoviruses, despite the deletion of El 
genes, express low levels of other viral proteins. This 
could be due to basal expression from the unstimulated 
25 viral promoters or transactivation by cellular factors. 
Expression of viral proteins leads to cellular immune 
responses to the genetically modified cells, resulting in 
their destruction and replacement with nontransgene 
containing cells. 
30 There yet remains a need in the art for the 

development of additional adenovirus vector constructs 
for gene therapy. 


35 
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Summary of the Invention 

In one aspect, the invention provides the components 
of a novel recombinant adenovirus production system. One 
component is a shuttle plasmid, pAdA, that comprises 
5 adenovirus cis-elements necessary for replication and 
virion encapsidation and is deleted of all viral genes. 
This vector carries a selected transgene under the 
control of a selected promoter and other conventional 
vector/ plasmid regulatory components. The other 
10 component is a helper adenovirus, which alone or with a 
packaging cell line, supplies sufficient gene sequences 
necessary for a productive viral infection. In a 
preferred embodiment, the helper virus has been altered 
to contain modifications to the native gene sequences 
15 which direct efficient packaging, so as to substantially 
disable or "cripple" the packaging function of the helper 
virus or its ability to replicate. 

In another aspect, the present invention provides a 
unique recombinant adenovirus, an Ada virus, produced by 
20 use of the components above. This recombinant virus 

comprises an adenovirus caps id, adenovirus cis-elements 
necessary for replication and virion encapsidation, but 
is deleted of all viral genes (i.e., all viral open 
reading frames) . This virus particle carries a selected 
25 transgene under the control of a selected promoter and 
other conventional vector regulatory components. This 
Ada" recombinant virus is characterized by high titer 
transgene delivery to a host cell and the ability to 
stably integrate the transgene into the host cell 
30 chromosome. In one embodiment, the virus carries as its 
transgene a reporter gene. Another embodiment of the 
recombinant virus contains a therapeutic transgene. 

In another aspect, the invention provides a method 
f r producing th above-described recombinant Ada virus 
35 by co-transf cting a cell lin (either a packaging cell 
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line or a non-packaging cell line) with a shuttle vector 
or plasmid and a helper adenovirus as described above, 
wherein the transfected cell generates the AdA virus. 
The AdA virus is subsequently isolated and purified 
therefrom. 

In yet a further aspect, the invention provides a 
method for delivering a selected gene to a host cell for 
expression in that cell by administering an effective 
amount of a recombinant AdA virus containing a 
therapeutic transgene to a patient to treat or correct a 
genetically associated disorder or disease. 

Other aspects and advantages of the present 
invention are described further in the following detailed 
description of the preferred embodiments thereof. 

p ri«f Page Up* 10 " of the Figures 

Fig. 1A is a schematic representation of the 
organization of the major functional elements that define 
the 5' terminus from Ad5 including an inverted terminal 
repeat (ITR) and a packaging/ enhancer domain. The TATA 
box of the El promoter (black box) and E1A 
transcriptional start site (arrow) are also shown. 

Fig. IB is an expanded schematic of the 
packaging/enhancer region of Fig. 1A, indicating the five 
packaging (PAC) domains (A-repeats) , I through V. The 
arrows indicate the location of PCR primers referenced in 
Pigs. 9A and 9B below. 

Fig. 2A is a schematic of shuttle vector 
pAdA.CMVLacZ containing 5' ITR from Ad5, followed by a 
CMV promoter/enhancer, a LacZ gene, a 3' ITR from Ad5, 
and remaining plasmid sequence from plasnid pSP72 
backbone. Restriction endonuclease enzymes arm 
represented by conventional designations in the plasmid 
constructs. 
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Pig. 2B is a schematic of the shuttle vector 
digested with EcoRI to release the modified AdA genome 
from the pSP72 plasmid backbone. 

Pig. 2C is a schematic depiction of the function of 
5 the vector system, in the pre-ence of an El-deleted 

helper virus Ad.CBhpAP which encodes a reporter minigene 
for human placenta alkaline phosphatase (hpAP) , the 
AdA . CMVLacZ genome is packaged into preformed virion 
capsids, distinguishable from the helper virions by the 
10 presence of the LacZ gene. 

Figs. 3 A to 3F [SEQ ID NO: l] report the top DNA 
strand of the double-stranded plasmid pAdA . CMVLacZ . The 
complementary sequence may be readily obtained by one of 
skill in the art. The sequence includes the following 
15 components: 3' Ad ITR (nucleotides 607-28 of SEQ ID NO: 
1); the 5' Ad ITR (nucleotides 5496-5144 of SEQ ID NO: 
1) ; CMV promoter/enhancer (nucleotides 5117-4524 of SEQ 
ID NO: 1) ; SD/SA sequence (nucleotides 4507-4376 of SEQ 
ID NO: 1) ; LacZ gene (nucleotides 4320-845 of SEQ ID NO: 
20 l) ; and a poly A sequence (nucleotides 837-639 of SEQ ID 
NO: 1). 

Pig. 4A is a schematic of shuttle vector 
pAdAc. CMVLacZ containing an Ad5 5« ITR and 3 • ITR 
positioned head-to-tail, with a CMV enhancer/ promo ter- 

25 LacZ minigene immediately following the 5* ITR, followed 
by a plasmid pSP72 (Pr omega) backbone. Restriction 
endonuclease enzymes are represented by conventional 
designations in the plasmid constructs. 

Fig. 4B is a schematic depiction of the function of 

30 the vector system of Fig. 4A. in the presence of helper 
virus Ad.CBhpAP, the circular pADAc. CMVLacZ shuttle 
vector sequence is packaged into virion heads, 
distinguishable from the helper virions by the presence 
of the LacZ gen . 


35 
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Figs. 5A to 5F [SBQ ID NO: 2] report the top DHA 
strand of the double- stranded vector pAdAc.CMVLacZ. The 
complementary sequence may be readily obtained by one of 
skill in the art. The sequence includes the following 
components: 5' Ad ITR (nucleotides 600-958 of SEQ ID NO: 
2); CMV promoter/ enhancer (nucleotides 969-1563 of SEQ ID 
NO: 2); SD/SA sequence (nucleotides 1579-1711); LacZ gene 
(nucleotides 1762-5236 of SEQ ID NO: 2); poly A sequence 
(nucleotides 5245-5443 of SEQ ID NO: 2); and 3' Ad ITR 
(nucleotides 16-596 of SEQ ID NO: 2) .. 

Fig. 6 is a schematic of shuttle vector pAdA.CBCFTR 
containing 5' ITR from Ad5, followed by a chimeric CMV 
enhancer/ A act in promoter enhancer, a CFTR gene, a poly-A 
sequence, a 3' ITR from Ad5, and remaining plasmid 
sequence from plasmid pSL1180 (Pharmacia) backbone. 
Restriction endonuclease enzymes are represented by 
conventional designations in the plasmid constructs. 

Figs. 7A to 7H [SEQ ID NO: 3] report the top DNA 
strand of the double-stranded plasmid pAdA.CBCFTR. The 
complementary sequence may be readily obtained by one of 
skill in the art. The sequence includes the following 
components: 5' Ad ITR (nucleotides 9611-9254 of SEQ ID 
NO: 3) ; chimeric CMV enhancer/ 6 actin promoter 
(nucleotides 9241-8684 Of SEQ ID NO: 3); CFTR gene 
(nucleotides 8622-4065 of SEQ ID NO: 3); poly A sequence 
(nucleotides 3887-3684 of SEQ ID NO: 3) ; and 3' Ad ITR 
(nucleotides 3652-3073 of SEQ ID NO: 3) . The remaining 
plasmid backbone is obtained from pSL1180 (Pharmacia) . 

Fig. 8A illustrates the generation of 5' adenovirus 
terminal sequence that contained PAC domains I and II by 
PCR. See, arrows indicating righthand and lefthand (PAC 
II) PCR probes in Fig. IB. 
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Pig. 8B illustrates the generation of 5' terminal 
sequence that contained PAC domains I, II, III and IV by 
PCR. See, arrows indicating righthand and lefthand (PAC 
IV) PCR probes in Pig. IB. 
5 Fig. 8C depicts the amplification products subcloned 

into the multiple cloning site of pAd.Link.l (IHGT Vector 
core) generating pAd.PACII (domains I and II) and 
pAd.PACIV (domains I, II, m, and IV) resulting in 
crippled helper viruses, Ad.PACIl and Ad.PACIV with 
10 modified packaging (PAC) signals. 

Fig. 9A is a schematic representation of the 
subcloning of a human placenta alXaline phosphatase 
reporter minigene containing the immediate early CMV 
enhancer/ promoter (CMV) , human placenta alXaline 
15 phosphatase cOHA (hpAP) , and SV40 polyadenylation signal 
(PA) into pAd.PACII to result in crippled helper virus 
vector pAdA.PACIl.CMVhpAP. Restriction endonuclease 
enzymes are represented by conventional designations in 
the plasmid constructs. 
20 Fig. 9B is a schematic representation of the 

subcloning of the same minigene of Pig. 9A into pAd.PACIV 
to result in crippled helper virus vector 
pAd . PAC IV. CMV. hpAP . 

Pig. 10 is a flow diagram summarizing the synthesis 
25 of an adenovirus-based polycation helper virus conjugate 
and its combination with a pAdA shuttle vector to result 
in a novel viral particle complex. CsCl band purified 
helper adenovirus was reacted with the heterobifunctional 
crosslinker sulfo-SMCC and the capsid protein fiber is 
10 labeled with the nucleophilic maleimide moiety. Free 
sulfhydryls were introduced onto poly-L-lysine using 2- 
iminothiolane-HCl and mixed with the labelled adenovirus, 
resulting in the helper virus conjugate Ad-pLys . A 
unique adenovirus-based particle is generated by 
5 purifying the Ad-pLys c n jugate over a CsCl gradient to 
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remove unincorporated poly-L- lysine, followed by 
extensively dialyzing, adding shuttle plasmid DNAs to Ad- 
pLys and allowing the complex formed by the shuttle 
plasmid wrapped around Ad-pLys to develop. 

Fig. 11 is a schematic diagram of pCCL-DMD, which is 
described in detail in Example 9 below. 

Fig. 12 A - 12P provides the continuous DNA sequence 
of pAdA . CMVmDys [SEQ ID N0:10]. 

p ^H ned r toAeription of the Invention 

The present invention provides a unique recombinant 
adenovirus capable of delivering transgenes to target 
cells, as well as the components for production of the 
unique virus and methods for the use of the virus to 
treat a variety of genetic disorders. 

The AdA virus of this invention is a viral particle 
containing only the adenovirus cis-elements necessary for 
replication and virion encapsidation (i.e. , ITRs and 
packaging sequences), but otherwise deleted of all 
adenovirus genes (i.e., all viral open reading frames). 
This virus carries a selected transgene under the control 
of a selected promoter and other conventional regulatory 
components, such as a poly A signal. The AdA virus is 
characterized by improved persistence of the vector DMA 
in the host cells, reduced antigenicity/ immunogenicity, 
and hence, improved performance as a delivery vehicle. 
An additional advantage of this invention is that the AdA 
virus permits the packaging of very large transgenes, 
such as a full-length dystrophin cDHA for the treatment 
of the progressive wasting of muscle tissue 
characteristic of Duchenne Muscular Dystrophy (DMD) . 

This novel recombinant virus is produced by use of 
an adenovirus-based vector product! n system containing 
two components: 1) a shuttle vector that compris s 
adenovirus cis-elements necessary f r r plication and 
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virion encapsidation and is d leted f all viral genes, 
which vector carries a reporter or therapeutic minigene 
and 2) a helper adenovirus which, alone or with a 
packaging cell line, is capable of providing all of the 
viral gene products necessary xor a productive viral 
infection when co-transfected with the shuttle vector. 
Preferably, the helper virus is modified so that it does 
not package itself efficiently, m this setting, it is 
desirably used in combination with a packaging cell line 
that stably expresses adenovirus genes. The methods of 
producing this viral vector from these components include 
both a novel means of packaging of an 

adenoviral/ transgene containing vector into a virus, and 
a novel method for the subsequent separation of the 
15 helper virus from the newly formed recombinant virus. 

X* The Shuttle vector 

The shuttle vector, referred to as pAdA, is composed 
of adenovirus sequences, and transgene sequences, 
20 including vector regulatory control sequence*. 
Aj. The Adenovirus .^f flf>ffrfr 

The adenovirus nucleic acid sequences of the 
shuttle vector provide the minimum adenovirus sequences 
which enable a viral particle to be produced with the 
25 assistance of a helper virus. These sequences assist in 
delivery of a recombinant transgene genome to a target 
cell by the resulting recombinant virus. 

The DNA sequences of a number of adenovirus 
types are available from Genbank, including type Ad5 
30 [Genbank Accession No. K73260]. The adenovirus sequences 
nay be obtained from any known adenovirus serotype, such 
as serotypes 2, 3, 4, 7, 12 and 40, and further including 
any of the presently identified 41 human types [see, 
e.g., Horwitz, cited above], similarly aden viruses 
35 known t infect ther animals may also be employed in the 
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vector constructs of this invention. The selection of 
the adenovirus type is not anticipated to limit the 
following invention. A variety of adenovirus strains are 
available from the American Type Culture Collection, 
5 Rockville, Maryland, or available by request from a 

variety of commercial and institutional sources. In the 
following exemplary embodiment an adenovirus, type 5 
(Ad5) is used for convenience. 

However, it is desirable to obtain a variety of 
10 pAdA shuttle vectors based on different human adenovirus 
serotypes. It is anticipated that a library of such 
plasmids and the resulting AdA viral vectors would be 
useful in a therapeutic regimen to evade cellular, and 
possibly humoral, immunity, and lengthen the duration of 
15 transgene expression, as well as improve the success of 
repeat therapeutic treatments. Additionally the use of 
various serotypes is believed to produce recombinant 
viruses with different tissue targeting specificities. 
The absence of adenoviral genes in the AdA viral vector 
20 is anticipated to reduce or eliminate adverse CTL 

response which normally causes destruction of recombinant 
adenoviruses deleted of only the El gene. 

Specifically, the adenovirus nucleic acid 
sequences employed in the pAdA shuttle vector of this 
25 invention are adenovirus genomic sequences from which all 
viral genes are deleted. More specifically, the 
adenovirus sequences employed are the cis-acting 5' and 
3« inverted terminal repeat (ITR) sequences of an 
adenovirus (which function as origins of replication) and 
30 the native 5* packaging/enhancer domain, that contains 
sequences necessary for packaging linear Ad genomes and 
enhancer elements for the El promoter. These sequences 
are the sequences necessary f r r plication and virion 
encapsidati n. See, .g., P. Hearing et al, J, Virol. , 
35 fil(8):2555-2558 (1987); M. Grabl and P. H aring, 
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ylZQl^., £1(5): 2947-2056 (1990); and M. Grable and P. 
Hearing, J. Virol.. fifi(2) :723-731 (1992). 

According to this invention, the entire 
adenovirus 5* sequence containing the 5 a ITR and 
5 packaging/ enhancer region can be employed as the 5* 
adenovirus sequence in the pAdA shuttle vector. This 
left terminal (5') sequence of the Ad5 genome useful in 
this invention spans bp l to about 360 of the 
conventional adenovirus genome, also referred to as map 

10 units 0-1 of the viral genome. This sequence is provided 
herein as nucleotides 5496-5144 of SEQ 10 NO: 1, 
nucleotides 600-958 of SEQ ID NO: 2; and nucleotides 
9611-9254 of SEQ ID NO: 3, and generally is from about 
353 to about 360 nucleotides in length. This sequence 

15 includes the 5 • ITR (bp 1-103 of the adenovirus genome) , 
and the packaging/ enhancer domain (bp 194-358 of the 
adenovirus genome). See, Figs. 1A, 3, 5, and 7. 

Preferably, this native adenovirus 5' region is 
employed in the shuttle vector in unmodified form. 

20 However, some modifications including deletions, 

substitutions and additions to this sequence which do not 
adversely effect its biological function may be 
acceptable. See, e.g., W0 93/24641, published December 
9, 1993. The ability to modify these ITR sequences is 

25 within the ability of one of skill in the art. See, 

e.g., texts such as Sambrook et al, "Molecular Cloning. 
A Laboratory Manual.", 2d edit.. Cold Spring Harbor 
Laboratory, Cold Spring Harbor, New York (1989) . 

The 3* adenovirus sequences of the shuttle 

30 vector include the right terminal (3*) ITR sequence of 
the adenoviral genome spanning about bp 35,353 - end of 
the adenovirus genome, or map units '98.4-100. This 
sequence is provided herein as nucleotides 607-28 of SEQ 
ID NO: l, nucleotides 16-596 f SEQ ID NO: 2; and 

35 nucl otides 3652-3073 of SEQ ID NO: 3, and generally is 
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about 580 nucleotides in length. This entire sequence is 
desirably employed as the 3' sequence of an pAdA shuttle 
vector. Preferably, the native adenovirus 3« region is 
employed in the shuttle vector in unmodified form. 
However, some modifications tc this sequence which do not 
adversely effect its biological function may be 
acceptable. 

An exemplary pAdA shuttle vector of this 
invention, described below and in Fig. 2A, contains only 
those adenovirus sequences required for packaging 
adenoviral genomic DNA into a preformed capsid head. The 
pAdA vector contains Ad5 sequences encoding the 5' 
terminal and 3' terminal sequences (identified in the 
description of Fig. 3), as well as the transgene 
sequences described below. 

From the foregoing information, it is expected 
that one of skill in the art may employ other equivalent 
adenovirus sequences for use in the AdA vectors of this 
invention. These sequences may include other adenovirus 
strains, or the above mentioned cis-acting sequences with 
minor modifications. 

2^ Tftf ipran9tTMfl 

The transgene sequence of the vector and 
recombinant virus is a nucleic acid sequence or reverse 
transcript thereof, heterologous to the adenovirus 
sequence, which encodes a polypeptide or protein of 
interest. The transgene is operatively linked to 
regulatory components in a manner which permits transgene 
transcription. 

The composition of the transgene sequence will 
depend upon the use to which the resulting virus will be 
put. For example, one type of transgene sequence 
includes a r porter sequence, which upon expression 
produces a detectable signal. Such r porter sequences 
include without limitation an B. eoli beta-galactosldase 
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(LacZ) cDNA, a human placental alkaline phosphatase gene 
and a green fluorescent protein gene. These sequences, 
when associated with regulatory elements which drive 
their expression, provide signals detectable by 
5 conventional means, e.g., ultra siolet wavelength 
absor bancs, visible color change, etc. 

Another type of transgene sequence includes a 
therapeutic gene which expresses a desired gene product 
in a host cell. These therapeutic nucleic acid sequences 

10 typically encode products for administration and 

expression in a patient in vivo or ex vivo to replace or 
correct an inherited or non-inherited genetic defect or 
treat an epigenetic disorder or disease. Such 
therapeutic genes which are desirable for the performance 

15 of gene therapy include, without limitation, a normal 

cystic fibrosis transmembrane regulator (CFTR) gene (see 
Fig. 7) , a low density lipoprotein (LDL) gene (T. 
Yamamoto et al, Sell, 12:27-28 (November, 1984)], a DMD 
cONA sequence [partial sequences available from GenBank, 

20 Accession Nos. M36673, M36671, [A. P. Monaco et al, 

HatSOS, 121:646-650 (1986)] and L06900, [Roberts et al, 
Hum. Mutat.. 2:293-299 (1993)]] (Genbank) , and a number 
of genes which may be readily selected by one of skill in 
the art. The selection of the transgene is not 

25 considered to be a limitation of this invention, as such 
selection is within the knowledge of the art-skilled. 
£j. Regulatory Elements 

In addition to the major elements identified 
above for the pAdA shuttle vector, i.e., the adenovirus 

30 sequences and the transgene, the vector also includes 
conventional regulatory elements necessary to drive 
expression of the transgene in a cell transf ected with 
the pAdA vector. Thus the vector contains a selected 
promoter which is linked t the transgene and located. 


35 
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with the transgene, between the adenovirus sequences of 
the vector. 

Selection of the promoter is a routine matter 
and is not a limitation of the pAdA vector itself. 
5 Useful promoters may be consticutive promoters or 
regulated (inducible) promoters, which will enable 
control of the amount of the transgene to be expressed. 
For example, a desirable promoter is that of the 
cytomegalovirus immediate early promoter /enhancer [see, 

10 e.g., Boshart et al, £sU, 11:521-530 (1985)]. This 

promoter is found at nucleotides 5117-4524 of SBQ ID NO: 
1 and nucleotides 969-1563 of SEQ ID HO: 2. Another 
promoter is the CMV enhancer /chicken B-actin promoter 
(nucleotides 9241-8684 of SEQ ID HO: 3). Another 

15 desirable promoter includes, without limitation, the Rous 
sarcoma virus LTR promoter /enhancer. Still other 
promoter/ enhancer sequences may be selected by one of 
skill in the art. 

The shuttle vectors will also desirably contain 

20 nucleic acid sequences heterologous to the adenovirus 

sequences including sequences providing signals required 
for efficient polyadenylation of the transcript and 
introns with functional splice donor and acceptor sites 
(SD/SA) . A common poly-A sequence which is employed in 

25 the exemplary vectors of this invention is that derived 
from the papovavirus SV-40 [see, e.g., nucleotides 837- 
639 of SEQ ID NO: 1; 5245-5443 Of SEQ ID HO: 2; and 3887- 
3684 of SEQ ID NO: 3]. The poly-A sequence generally is 
inserted in the vector following the transgene sequences 

30 and before the 3' adenovirus sequences. A common intron 
sequence is also derived from SV-40, and is referred to 
as the SV-40 T intron sequence [see, e.g., nucleotides 
4507-4376 of SEQ ID HO: 1 and 1579-1711 f SEQ ID NO: 2]. 
A pAdA shuttle v ctor of the present invention may also 

35 c ntain such an intron, desirably located between the 
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promoter/ enhancer sequence and the transgene. Selection 
of these and other common vector elements are 
conventional and many such sequences are available [see, 
e.g., Sambrook et al, and references cited therein]. 
5 Examples of such regulatory sequences for the above are 
provided in the plasmid sequences of Figs. 3, 5 and 7. 

The combination of the transgene, promoter/ 
enhancer, the other regulatory vector elements are 
referred to as a "minigene" for ease of reference herein. 

10 The minigene is preferably flanked by the 5 • and 3 1 cis- 
acting adenovirus sequences described above. Such a 
minigene may have a size in the range of several hundred 
base pairs up to about 30 kb due to the absence of 
adenovirus early and late gene sequences in the vector. 

15 Thus, this AdA vector system permits a great deal of 

latitude in the selection of the various components of 
the minigene, particularly the selected transgene, with 
regard to size. Provided with the teachings of this 
invention, the design of such a minigene can be made by 

20 resort to conventional techniques. 

zz. The Helper virus 

Because of the limited amount of adenovirus sequence 
present in the AdA shuttle vector, a helper adenovirus of 

25 this invention must, alone or in concert with a packaging 
cell line, provide sufficient adenovirus gene sequences 
necessary for a productive viral infection. Helper 
viruses useful in this invention thus contain selected 
adenovirus gene sequences, and optionally a second 

30 reporter minigene. 

Normally, the production of a recombinant adenovirus 
which utilizes helper adenovirus containing a full 
complement of adenoviral genes results in recombinant 
virus contaminated by excess producti n of the helper 

35 virus. Thus, extensiv purificati n of the viral vect r 
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from the contaminating helper virus is required. 
However, the present invention provides a way to 
facilitate purification and reduce contamination by 
crippling the helper virus. 

One preferred embodiment of a helper virus of this 
invention thus contains three components (A) 
modifications or deletions of the native adenoviral gene 
sequences which direct efficient packaging, so as to 
substantially disable or "cripple- the packaging function 
of the helper virus or its ability to replicate, (B) 
selected adenovirus genes and (C) an optional reporter 
minigene. These "crippled- helper viruses may also be 
formed into poly-cation conjugates as described below. 

The adenovirus sequences forming the helper virus 
may be obtained from the sources identified above in the 
discussion of the shuttle vector. Use of different Ad 
serotypes as helper viruses enables production of 
recombinant viruses containing the *Ad (serotype 5) 
shuttle vector sequences in a capsid formed by the other 
serotype adenovirus. These recombinant viruses are 
desirable in targeting different tissues, or evading an 
immune response to the *Ad sequences having a serotype 5 
capsid. Use of these different Ad serotype helper 
viruses may also demonstrate advantages in recombinant 
virus production, stability and better packaging. 

A*. ™~ CTloolj^ unifications 

A desirable helper virus used in the production 
of the adenovirus vector of this invention is modified 
(or crippled) in its 5' ITR packaging/ enhancer domain, 
identified above. As stated above, the 
packaging/enhancer region contains sequences necessary 
for packaging linear adenovirus genomes ("PAC* 
sequences) . More specifically, this sequence contains at 
least seven distinct yet functi nally redundant domains 
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that are required for efficient encapsidation of 
replicated viral DNA. 

Within a stretch of nucleotide sequence from bp 
194-358 of the Ad5 genome, five of these so-called A- 
5 repeats or PAC sequences are localized (see. Fig. IB) . 
PAC I is located at bp 241-248 of the adenovirus genome 
(on the strand complementary to nucleotides 5259-5246 of 
SEQ ID NO: l) . PAC II is located at bp 262-269 of the 
adenovirus genome (on the strand complementary to 

10 nucleotides 5238-5225 of SEQ ID NO: 1). PAC III is 

located at bp 304-311 of the adenovirus genome (on the 
strand complementary to nucleotides 5196-5183 of SEQ ID 
NO: l). PAC IV is located at bp 314-321 of the 
adenovirus (on the strand complementary to nucleotides 

15 5186-5172 of SEQ ID NO: 1). PAC V is located at bp 339- 
346 of the adenovirus (on the strand complementary to 
nucleotides 5171-5147 of SEQ ID NO: 1) . 

Corresponding sequences can be obtained from 
SEQ ID NO: 2 and 3. PAC I is located at nucleotides 837- 

20 851 of SEQ ID NO: 2; and on the strand complementary to 

nucleotides 9374-9360 of SEQ ID NO: 3. PAC II is located 
at nucleotides 859-863 of SEQ ID NO: 2; and on the strand 
complementary to nucleotides 9353-9340 of SEQ ID NO: 3. 
PAC III is located at nucleotides 901-916 of SEQ ID NO: 

25 2; and on the strand complementary to nucleotides 9311- 
9298 of SEQ ID NO: 3. PAC IV is located at nucleotides 
911-924 of SEQ ID NO: 2; and on the strand complementary 
to nucleotides 9301-9288 of SEQ ID NO: 3. PAC V is 
located at nucleotides 936-949 of SEQ ID NO: 2; and on 

30 the strand complementary to nucleotides 9276-9263 of SEQ 
ID NO: 3. 
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Table l below lists these five native Ad5 
sequences and a consensus PAC sequence based on the 
similarities between an eight nucleic acid stretch within 
the five sequences. The consensus sequence contains two 
positions at which the nucleic acid may be A or T (A/T) . 
The conventional single letter designations are used for 
the nucleic acids, as is known to the art. 

Table 1 

Adenovirus Genome 
Base Pair Nos. & 



A-Reoeat 

wucieotlde sequence 




15 

I 

241 248 
TAG TAAATTTG GGC 

[SEQ 

ID 

NO: 

4] 


II 

262 269 
AGT AAGATTTG GCC 

[SEQ 

ID 

NO: 

5] 

20 

III 

304 311 
AGT GAAATCTG AAT 

[SEQ 

ID 

NO: 

6] 

25 

rv 

314 321 
GAA TAATTTTG TGT 

[SEQ 

ID 

NO: 

7] 


V 

339 346 
CGT AATATTTG TCT 

[SEQ 

ID 

NO: 

8] 

30 

Consensus 5' 

(A/T) AN (A/T) TTTG 3« 

[SEQ 

ID 

NO: 

91 


According to this invention, mutations or 
deletions may be made to one or more of these PAC 
sequences to generate desirable crippled helper viruses. 
A deletion analysis of the packaging domain revealed a 
positive correlation between encapsidation efficiency and 
the number of packaging A-repeats that were present at 
the 5' end of the genome. Modifications of this domain 
may include 5' adenovirus sequences which contain less 
than all five of the PAC sequences of Table 1. For 
exampl , nly two PAC sequence may be present in the 
crippled virus, e.g., PAC I and PAC II, PAC III and PAC 
IV, and s n. Deletions f s lected PAC sequences may 
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involve deletion of contiguous or non-contiguous 
sequences. For example, PAC II and PAC IV nay be 
deleted, leaving PAC I, III and IV in the 5' sequence. 
Still an alternative modification may be the replacement 
5 of one or more of the native P*c sequences with one or 
more repeats of the consensus sequence of Table 1. 
Alternatively, this adenovirus region may be modified by 
deliberately inserted mutations which disrupt one or more 
of the native PAC sequences. One of skill in the art may 

10 further manipulate the PAC sequences to similarly achieve 
the effect of reducing the helper virus packaging 
efficiency to a desired level. 

Exemplary helper viruses which involve the 
manipulation of the PAC sequences described above are 

15 disclosed in Example 7 below. Briefly, as described in 
that example, one helper virus contains in place of the 
native 5' ITR region (adenovirus genome bp 1-360), a 5' 
adenovirus sequence spanning adenovirus genome bp 1-269, 
which contains only the 5« ITR and PAC I and PAC II 

20 sequences, and deletes the adenovirus region bp 270-360. 

Another PAC sequence modified helper virus 
contains only the 5* Ad5 sequence of the ITR and PAC I 
through PAC IV (Ad bp 1-321) , deleting PAC V and other 
sequences in the Ad region bp322-360. 

25 These modified helper viruses are characterized 

by reduced efficiency of helper virus encapsidation. 
These helper viruses with the specific modifications of 
the sequences related to packaging efficiency, provide a 
packaging efficiency high enough for generating 

30 production lots of the helper virus, yet low enough that 
they permit the achievement of higher yields of Ada 
transducing viral particles according to this invention. 
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*™» select ** Adenovirus Genes 
Helper viruses useful in this invention, 
whether or not they contain the "crippling" modifications 
described above, contain selected adenovirus gene 
sequences depending upon the call line which is 
transfected by the helper virus and shuttle vector. A 
preferred helper virus contains a variety of adenovirus 
genes in addition to the modified sequences described 
above. 

As one example, if the cell line employed to 
produce the recombinant virus is not a packaging cell 
line, the helper virus may be a wild type Ad virus. 
Thus, the helper virus supplies the necessary adenovirus 
early genes El, E2, E4 and all remaining late, 
intermediate, structural and non-structural genes of the 
adenovirus genome. This helper virus may be a crippled 
helper virus by incorporating modifications in its native 
5' packaging/ enhancer domain. 

A desirable helper virus is replication 
defective and lacks all or a sufficient portion of the 
adenoviral early immediate early gene Ela (which spans mu 
1.3 to 4.5) and delayed early gene Elb (which spans mu 
4.6 to 11.2) so as to eliminate their normal biological 
functions, such replication deficient viruses may also 
have crippling modifications in the packaging/ enhancer 
domain. Because of the difficulty surrounding the 
absolute removal of adenovirus from Ada preparations that 
have been enriched by CsCl buoyant density 
centrif ugation, the use of a replication defective 
adenovirus helper prevents the introduction of infectious 
adenovirus for in vivo animal studies. This helper virus 
is employed with a packaging cell line which supplies the 
d ficient El pr teins, such as the 293 cell line. 


35 
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Additionally, all or a portion of the 
adenovirus delayed early gene E3 (which spans mu 76.6 to 
86.2) may be eliminated from the adenovirus sequence 
which forms a part of the helper viruses useful in this 
5 invention, without adversely aifecting the function of 
the helper virus because this gene product is not 
necessary for the formation of a functioning virus. 

In the presence of other packaging cell lines 
which are capable of supplying adenoviral proteins in 
10 addition to the El, the helper virus may accordingly be 
deleted of the genes encoding these adenoviral proteins. 

Such additionally deleted helper viruses also desirably 
contain crippling modifications as described above. 
£*. A Reporter Hiniaen* 
15 It is also desirable for the helper virus to 

contain a reporter minigene, in which the reporter gene 
is desirably different from the reporter transgene 
contained in the shuttle vector. A number of such 
reporter genes are known, as referred to above. The 
20 presence of a reporter gene on the helper virus which is 
different from the reporter gene on the pAdA, allows both 
the recombinant AdA virus and the helper virus to be 
independently monitored. For example, the expression of 
recombinant alkaline phosphatase enables residual 
25 quantities of contaminating adenovirus to be monitored 
independent of recombinant LacZ expressed by an pAdA 
shuttle vector or an AdA virus. 

0*. ffeiper Virus Poivcation amiuamtmm 

Still another method for reducing the 
30 contamination of helper virus involves the formation of 
poly-cation helper virus conjugates, which may be 
associated with a plasmid containing other adenoviral 
genes, which are not present in the helper virus. The 
h lper virus s described above may be further modified by 
35 r s rt t adenovirus -poly lysine conjugate techn logy. 
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See, e.g., Wu et al, aid. Chem. . 16985-16987 
(1989) ; and K* J. Fisher and J. M. Wilson, Blflfftien. J. * 
299 ; 49 (April l, 1994) , incorporated herein by 
reference. 

Using this technology, a helper virus 
containing preferably the late adenoviral genes is 
modified by the addition of a poly-cation sequence 
distributed around the capsid of the helper virus. 
Preferably, the poly-cation is poly-lysine, which 
attaches around the negatively-charged vector to form an 
external positive charge. A plasmid is then designed to 
express those adenoviral genes not present in the helper 
virus, e.g., the El, E2 and/or E4 genes. The plasmid 
associates to the helper virus-conjugate through the 
charges on the poly-lysine sequence. This modification 
is also desirably made to a crippled helper virus of this 
invention. This conjugate (also termed a trans-infection 
particle) permits additional adenovirus genes to be 
removed from the helper virus and be present on a plasmid 
which does not become incorporated into the virus during 
production of the recombinant viral vector. Thus, the 
impact of contamination is considerably lessened. 

XII. Assembly of Shuttle Vector, Helper Virus and 

Production of Re co mbi n a n t virus 

The material from which the sequences used in the 
pAdA shuttle vector and the helper viruses are derived, 
as well as the various vector components and sequences 
employed in the construction of the shuttle vectors, 
helper viruses, and Ada viruses of this invention, are 
obtained from commercial or academic sources based on 
previously published and described materials. These 
materials may also be btained from an individual pati nt 
or generated and selected using standard recombinant 
molecular cloning techniques known and practiced by those 
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skilled in the art. Any modification of existing nucleic 
acid sequences forming the vectors and viruses, including 
sequence deletions, insertions, and other mutations are 
also generated using standard techniques. 
5 Assembly of the selected D*JA sequences of the 

adenovirus, and the reporter genes or therapeutic genes 
and other vector elements into the pAdA shuttle vector 
using conventional techniques is described in Example l 
below. Such techniques include conventional cloning 

10 techniques of cDNA such as those described in texts 
[Sambrook et al, cited above], use of overlapping 
oligonucleotide sequences of the adenovirus genomes, 
polymerase chain reaction, and any suitable method which 
provides the desired nucleotide sequence. Standard 

15 transfection and co-transfection techniques are employed, 
e.g., CaP0 4 transfection techniques using the HEK 293 
cell line, other conventional methods employed in this 
invention include homologous recombination of the viral 
genomes, plaguing of viruses in agar overlay, methods of 

20 measuring signal generation, and the like. Assembly of 
any desired Ada vector or helper virus of this invention 
is within the skill of the art, based on the teachings of 
this invention. 

Ai. Shuttle ir»r*n r 

25 As described in detail in Example 1 below and 

with resort to Fig. 2A and the DMA sequence of the 
plasmid reported in Fig. 3, a unique pAdA shuttle vector 
of this invention, pAdA.CMVLacZ, is generated. 
pAdA.CMVLacZ contains Ad5 sequences encoding the 5' 

»0 terminal followed by a CMV promoter/ enhancer, a splice 
donor/ splice acceptor sequence, a bacterial beta- 
galactosidase gene (LacZ) , a SV-40 poly A sequence (pA) , 
a 3* ITR from Ad5 and remaining plasmid sequence from 
plasmid pSP72 (Pr omega) backbone. 
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To generate the AdA genome which i» 
incorporated in the vector, the plasmid pAdA. CMVLacZ must 
be must be digested with EcoRI to release the AdA . CMVLacZ 
genome, freeing the adenovirus ITRs and making them 
available targets for replication. Thus production of 
the vector is "restriction-dependent" , i.e., requires 
restriction endonuclease rescue of the replication 
template. See, Pig. 2B. 

A second type of pAdA plasmid was designed 
which places the 3« Ad terminal sequence in a head-to- 
tail arrangement relative to the 5' terminal sequence. 
As described in Example 1 and Figs. 4A, and with resort 
to the DNA sequence of the plasmid reported in Fig. 5, a 
second unique AdA vector sequence of this invention, 
AdAc. CMVLacZ, is generated from the shuttle plasmid 
pAdAc. CMVLacZ, which contains an Ad5 5" ITR sequence and 
3« ITR sequence positioned head-to-tail, followed by a 
CMV enhancer/ promoter, SD/SA sequence, LacZ gene and pA 
sequence in a plasmid pSP72 (Pr omega) backbone. As 
described in Example IB, this "restriction-independent" 
plasmid permits the AdA genome to be replicated and 
rescued from the plasmid backbone without including an 
endonuclease treatment (see, Fig. 4B) . 
&j. «olp»r- Vlrua 

As described in detail in Example 2, an 
exemplary conventional El deleted adenovirus heiper virus 
is virus Ad.CBhpAP, which contains a 5' adenovirus 
sequence from mu 0-1, a reporter minigene containing 
human placenta alkaline phosphatase (hpAP) under the 
transcriptional control of the chicken 0-actin promoter, 
followed by a poly-A sequence from SV40, followed by 
adenovirus sequences from 9.2 to 78.4 and 86 to 100. 
This h lper contained del ti ns f rom su 1.0 to 9.2 and 
78.4 to 86, which eliminate substantially the El region 
and the E3 region f th virus. This virus may be 
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d sirably crippled according to this invention by 
modifications to its packaging enhancer domain. 

Exemplary crippled helper viruses of this 
invention are described using the techniques described in 
5 Example 7 and contain the modified 5' PAC sequences, 
i.e., adenovirus genome bp 1-269; m.u. 0-0.75 or 
adenovirus genome bp 1-321; m.u. 0-0.89. Briefly, the 5« 
sequences are modified by PCR and cloned by conventional 
techniques into a conventional adenovirus based plasmid. 

10 A hpAP minigene is incorporated into the plasmid, which 
is then altered by homologous recombination with an E3 
deleted adenovirus dl700i to result in the modified 
vectors so that the reporter minigene is followed on its 
3* end with the adenovirus sequences mu 9.6 to 78.3 and 

15 87 to 100. 

Generation of a poly-L-lysine conjugate helper 
virus was demonstrated essentially as described in detail 
in Example 5 below and Pig. 10 by coupling poly-L-lysine 
to the Ad.CBhpAP virion capsid. Alternatively, the same 

20 procedure may be employed with the PAC sequence modified 
helper viruses of this invention. 
£*. Recombinant Ad A vima 

As stated above, a pAdA shuttle vector in the 
presence of helper virus and/or a packaging cell line 

25 permits the adenovirus-transgene sequences in the shuttle 
vector to be replicated and packaged into virion capsids, 
resulting in the recombinant Ada virus. The current 
method for producing such AdA virus is transfection-based 
and described in detail in Example 3. Briefly, helper 

30 virus is used to infect cells, such as the packaging cell 
line human HEK 293, which are then subsequently 
transfected with an pAdA shuttle vector containing a 
selected transgene by conventional methods. About 30 or 
more hours post-transfecti n, the c lis are harvested, 

35 and an extract prepared. The Ad* viral genome is 
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packaged into virions that sediment at a lower density 
than the helper virus in cesium gradients. Thus, the 
recombinant AdA virus containing a selected transgene is 
separated from the bulk of the helper virus by 
purification via buoyant density ultracentrifugation in a 
CsCl gradient. 

The yield of AdA transducing virus is largely 
dependent on the number of cells that are transfected 
with the pAdA shuttle plasmid, making it desirable to use 
a transf action protocol with high efficiency. One such 
method involves use of a poly-L-lysinylated helper 
adenovirus as described above. A pAdA shuttle plasmid 
containing the desired transgene under the control of a 
suitable promoter, as described above, is then complexed 
directly to the positively charged helper virus capsid, 
resulting in the formation of a single transf action 
particle containing the pAdA shuttle vector and the 
helper functions of the helper virus. 

The underlying principle is that the helper 
adenovirus coated with plasmid pAdA DNA will co-transport 
the attached nucleic acid across the cell membrane and 
into the cytoplasm according to its normal mechanism of 
cell entry. Therefore, the poly-L- lysine modified helper 
adenovirus assumes multiple roles in the context of an 
AdA-based complex. First, it is the structural 
foundation upon which plasmid DNA can bind increasing the 
effective concentration. Second, receptor mediated 
endocytosis of the virus provides the vehicle for cell 
uptake of the plasmid DNA. Third, the endosomalytic 
activity associated with adenoviral infection facilitates 
the release of internalized plasmid into the cytoplasm. 
And the adenovirus contributes trans helper functions on 
which the recombinant AdA virus is dependent f r 
replication and packaging f transducing viral particles. 
Th Ad-based transf ecti n procedur using an pAdA shuttle 
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vector and a polycation-helper conjugate is detailed in 
Example 6. Additionally, as described previously, the 
helper virus-plasmid conjugate may be another form of 
helper virus delivery of the omitted adenovirus genes not 
5 present in the pAdA vector. Such a structure enables the 
rest of the required adenovirus genes to be divided 
between the plasmid and the helper virus, thus reducing 
the self -replication efficiency of the helper virus. 

A presently preferred method of producing the 

10 recombinant AdA virus of this invention involves 

performing the above-described transfection with the 
crippled helper virus or crippled helper virus conjugate, 
as described above. A "crippled" helper virus of this 
invention is unable to package itself efficiently, and 

15 therefor permits ready separation of the helper virus 

from the newly packaged AdA vector of this invention by 
use of buoyant density ultracentrifugation in a CsCl 
gradient, as described in the examples below. 

20 XV. Timet ion of the Recombinant AdA Virus 

Once the AdA virus of this invention is produced by 
cooperation of the shuttle vector and helper virus, the 
AdA virus can be targeted to, and taken up by, a selected 
target cell. The selection of the target cell also 

25 depends upon the use of the recombinant virus, i.e., 

whether or not the transgene is to be replicated in vitro 
or ex vivo for production in a desired cell type for 
redelivery into a patient, or in vivo for delivery to a 
particular cell type or tissue. Target cells may be any 

30 mammalian cell (preferably a human cell) . For example, 
in in vivo use, the recombinant virus can target to any 
cell type normally infected by adenovirus, depending upon 
the route of administration, i.e., it can target, without 
limitati n, neurons, hepatocytes, epithelial cells and 


35 
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the like. The helper adenovirus sequences supply the 
sequences necessary to permit uptake of the virus by the 
AdA. 

Once the recombinant virus is taken up by a cell, 
the adenovirus flanked transgene is rescued from the 
parental adenovirus backbone by the machinery of the 
infected cell, as with other recombinant adenoviruses. 
Once uncoupled (rescued) from the genome of the AdA 
virus, the recombinant minigene seeks an integration site 
in the host chromatin and becomes integrated therein, 
either transiently or stably, providing expression of the 
accompanying transgene in the host cell. 

V. use of the AdA Viruses in oene Therapy 

The novel recombinant viruses and viral conjugates 
of this invention provide efficient gene transfer 
vehicles for somatic gene therapy. These viruses are 
prepared to contain a therapeutic gene in place of the 
LacZ reporter transgene illustrated in the exemplary 
viruses and vectors. By use of the AdA viruses 
containing therapeutic transgenes, these transgenes can 
be delivered to a patient in vivo or ax vivo to provide 
for integration of the desired gene into a target cell. 
Thus, these viruses can be employed to correct genetic 
deficiencies or defects. An example of the generation of 
an AdA gene transfer vehicle for the treatment of cystic 
fibrosis is described in Example 4 below. One of skill 
in the art can generate any number of other gene transfer 
vehicles by including a selected transgene for the 
treatment off other disorders. 

The recombinant viruses of the present invention may 
be administered to a patient, preferably suspended in a 
biologically compatible s lution or pharmaceutically 
acc ptable delivery vehicl . A suitabl vehicle includ s 
sterile saline. Other aqu ous and n n-aqueous isotonic 
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sterile injection solutions and aqueous and non-aqueous 
sterile suspensions known to be pharmaceuticals 
acceptable carriers and well known to those of skill in 
the art may be employed for this purpose. 
5 The recombinant viruses of this invention may be 

administered in sufficient amounts to transfect the 
desired cells and provide sufficient levels of 
integration and expression of the selected transgene to 
provide a therapeutic benefit without undue adverse 

10 effects or with medically acceptable physiological 

effects which can be determined by those skilled in the 
medical arts. Conventional and pharmaceutically 
acceptable parenteral routes of administration include 
direct delivery to the target organ, tissue or site, 

15 intranasal, intravenous, intramuscular, subcutaneous, 
intradermal and oral administration. Routes of 
administration may be combined, if desired. 

Dosages of the recombinant virus will depend 
primarily on factors such as the condition being treated, 

20 the selected gene, the age, weight and health of the 
patient, and may thus vary among patients. A 
therapeutically effective human dosage of the viruses of 
the present invention is believed to be in the range of 
from about 20 to about 50 ml of saline solution 

25 containing concentrations of from about 1 x 10 7 to 1 x 
10 10 pfu/ml virus of the present invention. A preferred 
human dosage is about 20 ml saline solution at the above 
concentrations. The dosage will be adjusted to balance 
the therapeutic benefit against any side effects. The 

30 levels of expression of the selected gene can be 

monitored to determine the selection, adjustment or 
frequency of dosage administration. 
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The following examples illustrate the construction 
of the pAdA shuttle vectors, helper viruses and 
recombinant AdA viruses of the present invention and the 
use thereof in gene therapy. These examples are 
illustrative only, and do not limit the scope of the 
present invention. 

rnr nrr 1 * 1 - ?*~*"<* Aon if r &HA CMV ^ r7 aT1f1 "ftjlrtCr.CMVLacZ 

A. p^A.eMVLacZ 

A human adenovirus Ad5 sequence was modified to 
contain a deletion in the Ela region [map units 1 to 
9.2], which immediately follows the Ad 5- region (bp 1- 
360) (illustrated in Pigs. 1A) . Thus, the plasmid 
contains the 5 * XTR sequence (bp 1-103), the native 
packaging/enhancer sequences and the TATA box for the Ela 
region (bp 104-360) . A minigene containing the CMV 
immediate early enhancer /promoter, an SD/SA sequence, a 
cytoplasmic lacZ gene, and SV40 poly A (pA) , was 
introduced at the site of the Ela deletion. This 
construct was further modified so that the minigene is 
followed by the 3 • ITR sequences (bp 35,353-end). The 
DNA sequences for these components are provided in Pig. 3 
and SEQ ID HO: 1 (see, also the brief description of this 
figure). 

This construct was then cloned by conventional 
techniques into a pSP72 vector (Pr omega) backbone to make 
the circular shuttle vector pAdACHVLacZ. See the 
schematic of Pig. 2A. This construct was engineered with 
EcoRI sites flanking the 5 ' and 3 • Ad5 ITR sequences. 
pAdA . CMVLacZ was then subjected to enzymatic digestion 
with EcoRI, releasing a linear fragment of the vector 
spanning the terminal end f the Ad 5 'ITR sequence 
through the terminal end of the 3 'ITR sequ nee from th 
plasmid backbone. See Pig. 2B. 
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B. pAdAc . CMVT j*r»^ 

The shuttle vector pAdAc.CMVLacZ (Figs. 4A and 
5) was constructed using a pSP72 (Promega) backbone so 
that the Ad5 5» ITR and 3 ' ITR were positioned head-to- 
tail. The organization of the Ad5 iTRs was based on 
reports that suggest circular Ad genomes that have the 
terminal ends fused together head-to-tail are infectious 
to levels comparable to linear Ad genomes. A minigene 
encoding the CMV enhancer, an SD/SA sequence, the LacZ 
gene, and the poly A sequence was inserted immediately 
following the 5* ITR. The DKA sequence of the resulting 
plasmid and the sequences for the individual components 
are reported in Fig. 5 and SEQ ID MO: 2 (see also, brief 
description of Fig. 5) . This plasmid does not require 
enzymatic digestion prior to its use to produce the viral 
particle (see Example 3). This vector was designed to 
enable restriction-independent production of LacZ Ad* 
vectors. 


Example 2 - Construction of » w f l per viyilB 

The Ad.CBhpAP helper virus [K. Kozarsky et al, som. 
CSll HP l . frMfflt ,, 12(5):449-458 (1993)] is a replication 
deficient adenovirus containing an alkaline phosphatase 
minigene. Its construction involved conventional cloning 
and homologous recombination techniques. The adenovirus 
DNA substrate was extracted from CsCl purified dl700l 
virions, an Ad5 (serotype subgroup C) variant that 
carries a 3 kb deletion between mu 78.4 through 86 in the 
nonessential E3 region (provided by Dr. William Wold, 
Washington University, St. Louis, Missouri) . Viral DNA 
was prepared for co-transf action by digestion with Clal 
(adenovirus genomic bp position 917) which removes the 
left arm of the genome encompassing adenovirus map units 
0-2.5. See lower diagram f Fig. IB. 
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A parental cloning vector, pAd.Bglll was designed. 
It contains two segments of wild-type Ad5 genome (i.e., 
nap units 0-1 and 9-16.1) separated by a unique Bglll 
cloning site for insertion of heterologous sequences. 
The missing Ad5 sequences between the two domains 
(adenovirus genome bp 361-3327) results in the deletion 
of Ela and the majority of Elb following recombination 
with viral DNA. 

A recombinant hpAP minigene was designed and 
inserted into the Bglll site of pAd.Bglll to generate the 
complementing plasmid, pAdCBhpAP. The linear arrangement 
of this minigene includes: 

(a) the chicken cytoplasmic fl-actin promoter 
[nucleotides +1 to +275 as described in T. A. Kost et al, 

Res. . H(23) -.8287 (1983); nucleotides 9241- 

8684 of Fig. 7]; 

(b) an SV40 intron (e.g. , nucleotides 1579-1711 of 
SEQ ID NO: 2), 

(c) the sequence for human placental alkaline 
phosphatase (available from Genbank) and 

(d) an SV40 polyadenylation signal (a 237 Bam Hl- 
Bcll restriction fragment containing the cleavage/poly-A 
signals from both the early and late transcription units; 
e.g., nucleotides 837-639 of SEQ ID NO: 1). 

The resulting complementing plasmid, pAdCBhpAP 
contained a single copy of recombinant hpAP minigene 
flanked by adenovirus coordinates 0-1 on one side and 
9.2-16.1 on the other. 

Plasmid DNA was linearized using a unique Nhel site 
immediately 5» to adenovirus map unit zero (0) and the 
above-identified adenovirus substrate and the 
complementing plasmid DMAs were transfected to 293 cells 
[ATCC CRL1573] using a standard calcium phosphate 
transfecti n procedure [see, e.g., Sambrook et al, cit d 
above] . The end result f homolog us recombination 
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involving sequences that map to adenovirus map units 9- 
16.1 is hybrid Ad.CBhpAP helper virus which contains 
adenovirus map units 0-1 and, in place of the Ela and Elb 
coding regions from the dl7001 adenovirus substrate, is 
5 the hpAP minigene from the plasmid, followed by Ad 

sequences 9 to 100, with a deletion in the E3 (78.4-86 
mu) regions. 

Example 3 - Production or Recombinant AdA virus 

10 The recombinant AdA virus of this invention are 

generated by co-transfection of a shuttle vector with the 
helper virus in a selected packaging or non-packaging 
cell line. 

As described in detail below, the linear fragment 

15 provided in Example 1A, or the circular AdA genome 

carrying the LacZ of Example IB, is packaged into the 
Ad.CBhpAP helper virus (Example 2) using conventional 
techniques, which provides an empty capsid head, as 
illustrated in Fig. 2C. Those virus particles which have 

20 successfully taken up the pAd shuttle genome into the 
capsid head can be distinguished from those containing 
the hpAP gene by virtue of the differential expression of 
LacZ and hpAP. 

In more detail, 293 cells (4 x 10 7 pfu 293 cells/ 150 

25 mm dish) were seeded and infected with helper virus 

Ad.CBhpAP (produced as described in Example 2) at an M0 1 
of 5 in 20 ml OMEM/2% fetal bovine serum (FBS) . This 
helper specific marker is critical for monitoring the 
level of helper virus contamination in AdA preparations 

30 before and after purification. The helper virus provides 
in trans the necessary helper functions for synthesis and 
packaging of the AdACMVLacZ genome. 

Two hours post infect i n, using ither the 
restriction-d pendent shuttle vector or the restrict i n- 

35 independent shuttle vect r, plasmid pAdA.CMVLacZ 
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(digested with ScoRI) or pAdAc.CMVLacZ DNA, each carrying 
a LacZ minigene, was added to the cells by a calcium 
phosphate precipitate (2.5 ml calcium phosphate 
transfection cocktail containing 50 ng plasmid DNA) . 

Thirty to forty hours pot i-transf ection , cells were 
harvested, suspended in 10 mK Tris-Cl (pH 8.0) (0.5 
ml/ 150 mm plate) and frozen at -80 8 C. Frozen cell 
suspensions were subjected to three rounds of freeze 
(ethanol-dry ice) -thaw (3TC) cycles to release virion 
capsids. Cell debris was removed by centrifugation 
(5,000xg for 10 minutes) and the clarified supernatant 
applied to a CsCl gradients to separate recombinant virus 
from helper virus as follows. 

Supematants (10 ml) applied to the discontinuous 
CsCl gradient (composed of equal volumes of CsCl at 1.2 
g/ml, 1.36 g/ml, and 1.45 g/ml 10 mM Tris-Cl (pH 8.0)) 
were centrifuged for 8 hours at 72,128Xg, resulting in 
separation of infectious helper virus from incompletely 
formed virions. Fractions were collected from the 
interfacing zone between the helper and top components 
and analyzed by Southern blot hybridization or for the 
presence of LacZ transducing particles. For functional 
analysis, aliguots (2.0 ml from each sample) from the 
same fractions were added to monolayers of 293 cells (in 
35 mm wells) and expression of recombinant 6- 
galactosidase determined 24 hours later. More 
specifically, monolayers were harvested, suspended in 0.3 
ml 10 mM Tris-Cl (pH 8.0) buffer and an extract prepared 
by three rounds of freeze-thaw cycles. Cell debris was 
removed by centrifugation and the supernatant tested for 
fl-galactosidase (LacZ) activity according to the 
procedure described in J. Price et al, PTW, Natl. Acad. 
scl. . USA . Si: 156-160 (1987). The specific activity 
(milliunits B-galactosidase/mg protein or reporter 
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enzymes was measured from indicator cells. For the 
recombinant virus, specific activity was 116. 

Fractions with B-galactosidase activity from the 
discontinuous gradient were sedimented through an 
5 equilibrium cesium gradient to further enrich the 
preparation for Ad* virus. A linear gradient was 
generated in the area of the recombinant virus spanning 
densities 1.29 to 1.34gm/ml. A sharp peak of the 
recombinant virus, detected as the appearance of the fl- 
it) gal activity in infected 293 cells, eluted between 1.31 
and 1.33 gm/dl. This peak of recombinant virus was 
located between two major A2 60 nm absorbing peaks and in 
an area of the gradient with the helper virus was 
precipitously dropping off. The equilibrium 
15 sedimentation gradient accomplished another 102 to 103 

fold purification of recombinant virus from helper virus. 
The yield of recombinant AdA . CMVLacZ virus recovered from 
a 50 plate prep after 2 sedimentations ranged from 107 to 
108 transducing particles. 
20 Analysis of lysates of cells transfected with the 

recombinant vector and infected with helper revealed 
virions capable of transducing the recombinant minigene 
contained within the vector. Subjecting aliquots of the 
fractions to Southern analysis using probes specific to 
25 the recombinant virus or helper virus revealed packaging 
of multiple molecular forms of vector derived sequence. 
The predominant form of the deleted viral genome was the 
size ("5.5 kb) of the corresponding double stranded DNA 
monomer (AdA. CMVLacZ) with less abundant but discrete 
30 higher molecular weight species ("10 kb and "15 kb) also 
present. Full-length helper virus is 35kb. Importantly, 
the peak of vector transduction activity corresponds with 
the highest molecular weight form of the deleted virus. 
Th s results confirm th hypothesis that iTRs and 
35 contigu us packaging sequ nc ar the nly elements 
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necessary for incorporation into virions. An apparently 
ordered or preferred rearrangement of the recombinant Ad 
monomer genome leads to a more biologically active 
molecule. The fact that larger molecular species of the 
5 deleted genome are 2x and 3x told larger than the monomer 
deleted virus genome suggests that the rearrangements may 
involve sequential duplication of the original genome. 

These same procedures may be adapted for production 
of a recombinant Ada virus using a crippled helper virus 
10 or helper virus conjugate as described previously. 

By^pie 4 - Recombinant AdA virus Containing a 

Th«i-»p<mtic Minigene 

To test the versatility of the recombinant AdA virus 
15 system, the reporter LacZ minigene obtained from 

pAdACMVLacZ was cassette replaced with a therapeutic 

minigene encoding CFTR. 

The minigene contained human CFTR cDNA [Riordan et 

ai; science . i^S.: 1066-1073 (1989); nucleotides 8622-4065 
20 of SEQ ID NO: 3] under the transcriptional control of a 

chimeric CMV enhancer/chicken B-actin promotor element 

(nucleotides +1 to +275 as described in T. A. Kost et al, 

Wiir-T. Acids Res. . H(23) -.8287 (1983); nucleotides 9241- 

8684 of SEQ' ID NO: 3, Fig. 7) ; and followed by an SV-40 
25 poly-A sequence (nucleotides 3887-3684 of SEQ ID NO: 3, 

Fig. 7), 

The CFTR minigene was inserted into the El deletion 
site of an Ad5 virus (called pAd.ElA) which contains a 
deletion in Ela from mu 1-9.2 and a deletion in E3 from 
30 mu 78.4-86. 

The resulting shuttle vector called pAdA.CBCFTR (see 
Figs. 6 and the DMA. sequence of Fig. 7 [SEQ ID NO: 3]) 
used the same Ad ITRs f pAdACHVLacZ, but the Ad5 
sequences terminated with Nhel sites instead f Ec RI. 


35 
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Therefore releass of the minigene from the plasmid was 
accomplished by digestion with Nhel. 

The vector production system described in Example 3 
vas employed, using the helper virus Ad.CBhpAP (Example 
5 2) . Monolayers of 293 cells giovn to 80-90% conf luency 
in 150 mm culture dishes were infected with the helper 
virus at an MOI of 5. Infections were done in DMEM 
supplemented with 2% PBS at 20 ml media/150 mm plate. 
Two hours post-infection, 50 ng plasmid DNA in 2.5 ml 
10 transfection cocktail was added to each plate and evenly 
distributed. 

Delivery of the pAdA.CBCFTR plasmid to 293 cells was 
mediated by formation of a calcium phosphate precipitate 
and AdA.CBCFTR virus resolved from Ad.CBhpAP helper virus 
15 by CsCl buoyant density ultracentrifugation as follows: 

cells were left in this condition for 10-14 h, 
afterwhich the infection/transfection media was replaced 
with 20 ml fresh DMEM/ 2% PBS. Approximately 30 h post- 
transfection, cells were harvested, suspended in 10 mM 
20 Tris-Cl (pH 8.0) buffer (0.5 ml/150 mm plate), and stored 
at -80°c. 

Frozen cell suspensions were lysed by three 
sequential rounds of freeze (ethanol-dry ice) -thaw 
(37 °C). Cell debris was removed by centrifugation (5,000 

25 x g for 10 min) and 10 ml clarified extract layered onto 
a CsCl step gradient composed of three 9.0 ml tiers with 
densities 1.45 g/ml, 1.36 g/ml, and 1.20 g/ml CsCl in 10 
mM Tris-Cl (pH 8.0) buffer. Centrifugation was performed 
at 20,000 rpm in a Beckman SW-28 rotor for 8 h at 4°C. 

30 Fractions (1.0 ml) were collected from the bottom of the 
centrifuge tube and analyzed for rAAd transducing 
vectors. Peak fractions were combined and banded to 
equilibrium. Fractions containing transducing virions 
were dialyzed against 20 mM HEPES (pH 7.8)/ 150 mM NaCl 
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(HBS) and stored frozen at -80 «C in the presence of 10% 
glycerol or as a liquid stock at -20«C (HBS+40% 
glycerol) . 

Fractions collected after ultracentrifugation were 
analyzed for transgene expression and vector DNA. For 
lacZ ArAd vectors, 2 ill aliquots were added to 293 cell 
monolayers seeded in 35 am culture wells. Twenty-four 
hours later cells were harvested, suspended in 0.3 ml 10 
mM Tris-Cl (pH 8.0) buffer, and lysed by three rounds of 
. f reeze-thaw. Cell debris was removed by centrifugation 
(15,000 x g for 10 min) and assayed for total protein 
[Bradford, (1976)] and 0-galactosidase activity [Sambrook 
et al, (1989)) using ONPG (o-Hitrophenyl p-D- 
galactopyranoside) as substrate. 

Expression of CPTR protein from the AdA.CBCFTR 
vector was determined by immunofluorescence localization. 
Aliquots of AdA.CBCFTR, enriched by two-rounds of 
ultracentrifugation and exchanged to HBS storage buffer, 
were added to primary cultures of airway epithelial cells 
obtained from the lungs of CF transplant recipients. 
Twenty-four hours after the addition of vector, cells 
were harvested and affixed to glass slides using 
centrifugal force (Cytospin 3, shandon scientific 
Limited) . Cells were fixed with freshly prepared 3% 
paraformaldehyde in PBS (1.4 iM KH 2 P0 4 , 4.3 mH Na 2 HP0 4 , 
2.7 mM KC1, and 137 mM KaCl) for 15 min at room 
temperature (RT) , washed twice in PBS, and permeabilized 
with 0.05% NP-40 for 10 min at RT. The 
immunofluorescence procedure began with a blocking step 
in 10% goat serum (PBS/GS) for 1 h at RT, followed by 
binding of the primary monoclonal mouse anti-human CFTR 
(R-domain specific) antibody (Genzyme) diluted 1:500 in 
PBS/GS f r 2 h at RT. Cells war washed extensively in 
PBS/GS and incubated for 1 h at RT with a donkey anti- 
mouse IgG (H+L) FITC conjugated 
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antibody (Jackson ImmunoResearch Laboratories) diluted 
1:100 in PBS/GS. 

For Southern analysis of vector DMA, 5 nl aliguots 
were taXen directly from CsCl fractions and incubated 
5 with 20 Ml capsid digestion butfer (50 mM Tris-Cl, pH 
8.0; 1.0 mM EDTA, pH 8.0; 0.5% SDS, and 1.0 mg/ml 
Proteinase K) at 50«C for 1 h. The reactions were 
allowed to cool to RT, loading dye was added, and 
electrophoresed through a 1.2% agarose gel. Resolved 

10 DNAs were electroblotted onto a nylon membrane (Hybond-N) 
and hybridized with a 32-P labeled restriction fragment. 
Blots were analyzed by autoradiography or scanned on a 
Phosphorimager 445 SI (Molecular Dynamics) . 

The results that were obtained from Southern blot 

15 analysis of gradient fractions revealed a distinct viral 
band that migrated faster than the helper Ad.CBhpAP DNA. 
The highest viral titers mapped to fractions 3 and 4. 
Quantitation of the bands in fraction 4 indicated the 
titer of Ad.CBhpAP was approximately i.5x greater than 

20 AdACBCFTR. However, if the size difference between the 
two viruses is factored in (Ad.CBhpAP-35 kb; 
AdACBCPTR-6.2 kb) , the viral titer (where l particle- l 
DNA molecule) of AdACB.CFTR is at least 4-fold greater 
than the viral titer of Ad.CBhpAP. 

25 While Southern blot analysis of gradient fractions 

was useful for showing the production of Ada viral 
particles, it also demonstrated the utility of 
ultracentrifugation for purifying AdA viruses. 
Considering the latter of these, both LacZ and CFTR 

30 transducing viruses banded in CsCl to an intermediate 
density between infectious adenovirus helper virions 
(1.34 g/ml) and incompletely formed capsids (1.31 g/ml) . 
The lighter density relative to helper virus likely 
results from the smaller genome carried by th AdA 

35 viruses. This further suggests changes in virus size 
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influences the density and purification of AdA virus. 
Regardless, the ability to separate AdA virus from the 
helper virus is an important observation and suggests 
further purification may be achieved by successive rounds 
5 of banding through CsCl. 

This recombinant virus is useful in gene therapy 
alone, or preferably, in the form of a conjugate prepared 
as described herein. 

10 Kvatmale S - Correcti on of Genetic Defect In CF airway 
Epithelial Cellfl with AdACB.CFTR 

Treatment of cystic fibrosis, utilizing the 
recombinant virus provided above, is particularly suited 
for in vivo, lung-directed, gene therapy. Airway 

15 epithelial cells are the most desirable targets for gene 
transfer because the pulmonary complications of CF are 
usually its most morbid and life-limiting. 

The recombinant AdACB.CFTR virus was fractionated on 
sequential CsCl gradients and fractions containing CFTR 

20 sequences, migrating between the adenovirus and top 

components fractions described above were used to infect 
primary cultures of human airway epithelial cells derived 
from the lungs of a CF patient. The cultures were 
subsequently analyzed for expression of CFTR protein by 

25 immunocytochemistry. Immunofluorescent detection with 
mouse anti-human CFTR (R domain specific) antibody was 
performed 24 hours after the addition of the recombinant 
virus. Analysis of mock infected CF cells failed to 
reveal significant binding to the R domain specific CFTR 

30 antibody. Primary airway epithelium cultures exposed to 
the recombinant virus demonstrated high levels of CFTR 
protein in 10-20% of the cells. 

Thus, the recombinant virus f the invention, 
containing the CFTR gene, may be deliver d directly int 

35 the airway, e.g. by a formulating the virus above, into a 
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preparation which can be inhaled. For example, the 
recombinant virus or conjugate of the invention 
containing the CFTR gene, is suspended in 0.25 molar 
sodium chloride. The virus or conjugate is taken up by 
5 respiratory airway cells and the gene is expressed. 

Alternatively, the virus or conjugates of the 
invention may be delivered by other suitable means, 
including site-directed injection of the virus bearing 
the CFTR gene. In the case of CFTR gene delivery, 

10 preferred solutions for bronchial instillation are 

sterile saline solutions containing in the range of from 
about 1 x 10 7 to 1 x 10 10 pfu/ml, more particularly, in 
the range of from about 1 x 10 8 to 1 x 10 9 pfu/ml of the 
virus of the present invention. 

15 Other suitable methods for the treatment of cystic 

fibrosis by use of gene therapy recombinant viruses of 
this invention may be obtained from the art discussions 
of other types of gene therapy vectors for CF. See, for 
example, U. S. Patent No. 5,240,846, incorporated by 

20 reference herein. 

Example 6 - synthesis of Polycativn Helper Vims 
Conjugate 

Another version of the helper virus of this 
25 invention is a polylysine conjugate which enables the 

pAdA shuttle plasmid to complex directly with the helper 
virus caps id. This conjugate permits efficient delivery 
of shuttle plasmid pAdA shuttle vector in tandem with the 
helper virus, thereby removing the need for a separate 
30 trans feet ion step. See, Fig. 10 for a diagrammatic 
outline of this construction. Alternatively, such a 
conjugate with a plasmid supplying some Ad genes and the 
helper supplying the remaining necessary genes for 
product! n of the Ada viral vector provides a n vel way 
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to reduce contacination of the helper virus, as discussed 
above. 

Purified stocks of a large-scale expansion of 
Ad.CBhpAP were modified by coupling poly-L- lysine to the 
virion capsid essentially as described by K. J. Fisher 
and J. M. Wilson, Blochem. J.. 222:49-58 (1994), 
resulting in an Ad.CBhpAP-(Lys) n conjugate. The 
procedure involves three steps. 

First, csCl band purified helper virus Ad.CBhpAP was 
reacted with the heterobifunctional crosslinker sulfo- 
SMCC [sulfo-(N-succinimidyl-4-(N-maleimidomethyl) 
cyclohexane-l-carboxylate] (Pierce) . The conjugation 
reaction, which contained 0.5 mg (375 nmol) of sulpho- 
SMCC and 6 x 10" A 260 helper virus particles in 3.0 nl of 
HBS, was incubated at 30»C for 45 minutes with constant 
gentle shaking. This step involved formation of a 
peptide bond between the active H-hydroxysuccinimide 
(NHS) ester of sulpho-SMCC and a free amine (e.g. lysine) 
contributed by an adenovirus protein sequence (capsid 
protein) in the vector, yielding a maleimide-activated 
viral particle. The activated adenovirus is shown in 
Fig. 10 having the capsid protein fiber labeled with the 
nucleophilic maleimide moiety. In practice, other capsid 
polypeptides including hexon and penton base are also 
targeted. 

Unincorporated, unreacted cross-linker was removed 
by gel filtration on a 1 cm x 15 cm Bio-Gel P-6DG (Bio- 
Rad Laboratories) column equilibrated with 50 mM Tris/HCl 
buffer, pH 7.0, and 150 mM HaCl. Peak A^,, fractions 
containing maleimide-activated helper virus were combined 
and placed on ice. 

Second, poly-L- lysine having a molecular mass of 58 
kDa at 10 mg/ml in 50 mM triethan lamine buffer (pH 8.0), 
150 mM NaCl and 1 mM EDTA was thi lated with 2- 
imminothiolane/HCl (Traut's Reagent; Pierce) to a molar 
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ratio of 2 moles -SH/mole polylysine under N 2 ; the cyclic 
thioimidate reacts with the poly (L-lysine) primary amines 
resulting in a thiolated polycation. After a 45 minute 
incubation at room temperature the reaction was applied 
5 to a 1 cm x 15 cm Bio-Gel P6DG column equilibrated with 
50 mM Tris/HCl buffer (pH 7.0), 150 mM NaCl and 2 mH EDTA 
to remove unincorporated Traut's Reagent. 

Quantification of free thiol groups was accomplished 
with Ellman's reagent [5,5'-dithio-bis-(2-nitrobenzoic 
10 acid) ] , revealing approximately 3-4 mol of -SH/mol of 
poly (L-lysine). The coupling reaction was initiated by 
adding l x lo 12 ^ particles of maleimide-activated 
helper virus/mg of thiolated poly (L-lysine) and 
incubating the mixture on ice at 4»C for 15 hours under 
15 argon. 2-mercaptoethylaiiine was added at the completion 
of the reaction and incubation carried out at room 
temperature for 20 minutes to block unreacted maleimide 
sites. 

Virus-polylysine conjugates, Ad.CPAP-p(Lys) n , were 
20 purified away from unconjugated poly (L-lysine) by 

ultracentrifugation through a CsCl step gradient with an 
initial composition of equal volumes of 1.45 g/ml (bottom 
step) and 1.2 g/ml (top step) CsCl in 10 mM Tris/HCl 
buffer (pH 8.0). Centrifugation was at 90,000 g for 2 
25 hours at 5«C. The final product was dialyzed against 20 
mM Hepes buffer (pH 7.8) containing 150 mM NaCl (BBS). 

Example 7 - Tttnmtlon Of AdA /helngy— nT.v« v j ral PartlHp 
The formation of Ad.CBhpAP-pLys/pAdA.CMVLacZ 

30 particle is initiated by adding 20 ng plasmid 

pAdA.CMVLacZ DMAS to 1.2 x 10« A^ particles Ad.CBhpAP- 
pLys in a final volume of 0.2 ml DMEM and allowing the 
complex to develop at room temperature for between 10-15 
minutes. This rati typically r presents the plasmid DNA 

35 binding capacity of a standard 1 t f adenovirus-pLys 
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conjugate and gives the highest levels of plasmid 
transgene expression. 

The resulting trans-infection particle is 
transfected onto 293 cells (4 x 10 7 cells seeded on a 150 
mm dish) . Thirty hours after transf action , the particles 
are recovered and subjected to a freeze/thaw technique to 
obtain an extract. The extract is purified on a CsCl 
step gradient with gradients at 1.20 g/ml, 1.36 g/ml and 
1.45 g/ml. After centrifugation at 90,000 x g for 8 
hours, the AdA vectors were obtained from a fraction 
under the top components as identified by the presence of 
Lacz, and the helper virus was obtained from a smaller, 
denser fraction, as identified by the presence of hpAP. 

m»u 8 - ron fl taru c »<»n of Modified Helper Viruses w i th 
CT-inpied Po nging fPACi Sequences 

This example refers to Pigs. 9A through 9C, 10A and 
10B in the design of modified helper- viruses of this 
invention. 

Ad5 5« terminal sequences that contained PAC domains 
I and II (Pig. 8A) or PAC domains I, II, III, and IV 
(Pig. 8B) were generated by PCR from the wild type Ad5 5" 
genome depicted in Pig. IB using PCR clones indicated by 
the arrows in Pig. IB. The resulting amplification 
products (Pig. 8A and 8B) sequences differed from the 
wild-type Ad5 genome in the number of A-repeats carried 
by the left (5«) end. 

As depicted in Pig. 8C, these amplification products 
were subcloned into the multiple cloning site of 
pAd.Link.l (IHGT Vector Core). pAd.Link.l is a 
adenovirus based plasmid containing adenovirus m.u. 9.6 
through 16.1. The insertion of the modified PAC regions 
int pAd.Link.l generated two vectors pAd.PACII 
(containing PAC domains I and II) and pAd.PACIV 
(containing PAC domains I, II, III, and IV). 
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Thereafter, as depicted in Pigs. 10A and 10B, for 
each of these plasmids, a human placenta alkaline 
phosphatase reporter minigene containing the immediate 
early CMV enhancer /promoter (CMV) , human placenta 
5 alkaline phosphatase cDMA (hpAt) , and SV40 

polyadenylation signal (pA) , was subcloned into each PAC 
vector, generating pAd.PACII.CMVhpAP and 
pAd.PACIV.CMVhpAP, respectively. 

These plasmids were then used as substrates for 
10 homologous recombination with dl7001 virus, described 
above, by co-transfection into 293 cells. Homologous 
recombination occurred between the adenovirus map units 
9-16 of the plasmid and the crippled Ad5 virus. The 
results of homologous recombination were helper viruses 
15 containing Ad5 5' terminal sequences that contained PAC 
domains I and II or PAC domains I, II, in, and IV, 
followed by the minigene, and Ad5 3* sequences 9.6-78.3 
and 87-100. Thus, these crippled viruses are deleted of 
the El gene and the E3 gene. 
20 The plaque formation characteristics of the PAC 

helper viruses gave an immediate indication that the PAC 
modifications diminished the rate and extent of growth. 
Specifically, PAC helper virus plaques did not develop 
until day 14-21 post-transfection, and on maturation 
25 remained small. From previous experience, a standard 

first generation Ad.CBhpAP helper virus with a complete 
left terminal sequence would begin to develop by day 7 
and mature by day 10. 

Viral plaques were picked and suspended in 0.5 ml of 
30 DMEM media. A small aliquot of the virus stock was used 
to infect a fresh monolayer of 293 cells and 
histochemically stained for recombinant alkaline 
phosphatase activity 24 hours post-infection. Six of 
eight Ad.PACIV.CMvhpAP (encod s A-repeats I-IV) clon s 
35 that were screened f r transgene expr ssi n were 
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positive, while all three Ad.PACII.CMVhpAP clones that 
were selected scored positive. The clones have been 
taken through two rounds of plaque purification and are 
currently being expanded to generate a working stock. 
5 These crippled helper viruses are useful in the 

production of the AdA virus particles according to the 
procedures described in Example 3. They are 
characterized by containing sufficient adenovirus genes 
to permit the packaging of the shuttle vector genome, but 

10 their crippled PAC sequences reduce their efficiency for 
self-encapsidation. Thus less helper viruses are 
produced in favor of more AdA recombinant viruses. 
Purification of AdA virus particles from helper viruses 
is facilitated in the CsCl gradient, which is based on ... 

15 the weight of the respective viral particles. This 

facility in purification is a decided advantage of the 
AdA vectors of this invention in contrast to adenovirus 
vectors having only El or smaller deletions. The AdA 
vectors even with minigenes of up to about 15 kb are 

20 significantly different in weight than wild type or other 
adenovirus helpers containing many adenovirus genes. 

FXiffl T 1 '* q ~ AdA v«ct°r containing a full-length 
rt Y «rt:i» Q phln transaene 

25 Duchenne muscular dystrophy (DMD) is a common x- 

linked genetic disease caused by the absence of 
dystrophin, a 427K protein encoded by a 14 kilobase 
transcript. Lack of this important sarcolemmal protein 
leads to progressive muscle wasting, weakness, and death. 

30 One current approach for treating this lethal disease is 
to transfer a functional copy of the dystrophin gene into 
the affected muscles. Por skeletal muscle, a 
replication-defectiv adenovirus represents an efficient 
delivery system. 


35 
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According to the present invention, a recombinant 
plasmid pAdA.CMVmdys was created which contains only the 
Ad5 cis-elements (i.e., ITRs and contiguous packaging 
sequences) and harbors the full-length murine dystrophin 
5 gene driven by the CMV promote.. This plasmid was 
generated as follows. 

PSL1180 [Pharmacia Biotech] was cut with Sot I, 
filled in by Klenow, and religated thus ablating the Not 
I site in the plasmid. The resulting plasmid is termed 
10 pSL1180NN and carries a bacterial ori and Amp resistance 
gene. 

pAdA. CMVLacZ of Example 1 was cut with EcoRI, 
Jclenowed, and ligated with the Apal-cut pSLHSONN to form 
pAdA. CMVLacZ (Apal) . 

15 The 14 Jcb mouse dystrophin cDNA [sequences 

provided in C. C. Lee et al, Nature . 349 :334-336 (1991)] 
was cloned in two large fragments using a lambda ZAP 
cloning vector (Stratagene) and subsequently cloned into 
the bluescript vector pSK- giving rise to the plasmid 

20 pCCL-DMD. A schematic diagram of this vector is provided 
in rig. n, which illustrates the restriction enzyme 
sites. 

pAdA. CMVLacZ (Apal) was cut with Notl and the 
large fragment gel isolated away from the lacZ cDNA. 
25 pCCL-DMD was also cut with Notl, gel isolated and 

subseqently ligated to the large Notl fragment of Notl 
digested pAdA. CMVLacZ (Apal) . The sequences of resulting 
vector, pAdA.CMVmdys, are provided in Pig. 12A-12P [SEQ 
ID NO: 10]. 

30 This plasmid contains sequences form the left- 

end of the Ad5 encompassing bp 1-360 (5» ITR) , a mouse 
dystrophin minigene under the control of the CMV 
promoter, and sequence from the right end of Ad5 spanning 
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bp 35353 to the end of the genome (3' ITR) . The minigene 
is followed by an SV-40 poly-A sequence similar to that 
described for the plasmids described above. 

The vector production system described herein is 
5 employed. Ten 150mm 293 plates are infected at about 90% 
confluency with a reporter recombinant El-deleted virus 
Ad.CBhpAP at an MOI of 5 for 60 minutes at 37»C. These 
cells are transfected with pAdA.CMVmDys by calcium 
phosphate co-precipitation using 50 ng linearized 
10 DNA/dish for about 12-16 hours at 37 °C. Media is 
replaced with DMEM + 10% fetal bovine serum. 

Full cytopathic effect is observed and a cell lysate 
is made by subjecting the cell pellet to f reeze-thaw 
procedures three times. The cells are subjected to an 
15 SW41 three tier CsCl gradient for 2 hours and a band 
migrating between the helper adenovirus and incomplete 
virus is detected. 

Fractions are assayed on a 6 well plate containing 
293 cells infected with 5X of fraction for 16-20 hours in 
20 DMEM + 2% TBS. Cells are collected, washed with 

phosphate buffered saline, and resuspended in 2 ml PBS. 
200X of the 2ml cell fractions is cytospun onto a slide. 

The cells were subjected to immunofluorescence for 
dystrophin as follows. Cells were fixed in 10N MeOH at 
25 -20°C. The cells were exposed to a monoclonal antibody 
specific for the carboxy terminus of human dystrophin 
[NCL-DYS2; Novocastra Laboratories Ltd. , UK] . Cells were 
then washed three times and exposed to a secondary 
antibody, i.e. 1:200 goat anti-mouse IgG in FITC. 
30 The titer/fraction for seven fractions revealed in 

the immunof luorescent stains were calculated by the 
following formula and reported in Table 2 below. 
DFU/field = (DFU/200X c lis) x 10 - DFU/10 4 cells - 
(DFU/5X viral fraction) x 20 = DFU/100X fraction. 
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Table 2 


Fraction 


DFU/100X 


1 


5 


2 


3 


6 X 10 3 


10 


4 


1.8 X 10* 


5 


9.6 X 10 3 


6 


200 


15 


7 


200 


A virus capable of transducing the dystrophin 
minigene is detected as a "positive" (i.e., green 

20 fluorescent) cell. The results of the IF illustrate that 
heat-treated fractions do not show positive 
immunofluorescence. Southern blot data suggest one 
species on the same size as the input DNA, with helper 
virus contamination. 

25 Tne recombinant virus can be subsequently separated 

from the majority of helper virus by sedimentation 
through cesium gradients. Initial studies demonstrate 
that the functional AdCMVAmDys virions are produced, but 
are contaminated with helper virus. Successful 

30 purification would render AdA virions that are incapable 
of encoding viral proteins but are capable of transducing 
murine skeletal muscle. 

Example 10 - Pseudofcyping 
35 The following experiment provides a method for 

preparing a recombinant AdA according to the invention, 
utilizing helper viruses from serotypes which differ from 
that of the pAdA in th transf action/ infecti n protocol. 
It is unexpected that the ITRs and packaging sequenc f 
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Ad5 could be incorporated into a virion of another 
serotype. 

A. Protocol 

The basic approach is to transfect the 
AdA.CMVlacS recombinant vims (Ad5) into 293 cells and 
subsequently infect the cell with the helper virus 
derived from a variety of Ad serotypes (2, 3, 4, 5, 7, 8, 
12, and 40). When CPE is achieved, the lysate is 
harvested and banded through two cesium gradients. 

More particularly, the Ad5 -based plasmid 
pAdA.CMVlacZ of Example 1 was linearized with EcoRI. The 
linearized plasmids were then transfected into ten 150 mm 
dishes of 293 cells using calcium phosphate co- 
precipitation. At 10-15 hours post transfect ion, wild 
type adenoviruses (of one of the following serotypes: 2, 
3, 4, 5, 7, 12, 40) were used to infect cells at an MOI 
of 5. The cells were then harvested at full CPE and 
lysed by three rounds of freeze-thawing. Pellet is 
resuspended in 4 mL Tris-HCl. Cell debris was removed by 
20 centrifugation and partial purification of AdSA.CHVIacZ 
from helper virus was achieved with 2 rounds of CsCl 
gradient centrifugation (SW41 column, 35,000 rpm, 2 
hours) . Fractions were collected from the bottom of the 
tube (fraction #1) and analysed for lac* transducing 
25 viruses on 293 target cells by histochemical staining (at 
20hPI). Contaminating helper viruses were quantitated 

by plaque assay. 

Except for adenovirus type 3, infection with Ad 
serotypes 2, 4, 5, 7, 12 and 40 were able to produce lacZ 

30 transducing viruses. The peak of p-galactosidase 
activity was detected between the two major Ajeo 
absorbing peaks, where most of the helper viruses banded 
(data not shown) . The quantity f lac* virus recovered 
from 10 plates ranged from 10 4 to 10 8 transducing 

35 particles depending n the ser type of the helper. As 
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expected Ad2 and Ad5 produced the highest titer of lac* 
transducing viruses (Table 3). wild type contamination 
was in general l0 2 -io 3 log higher than corresponding lac* 
titer except in the case of Ad40. 
B. Results 

Table 3 summarizes the growth characteristics 
of the wild type adenoviruses as evaluated on propagation 
in 293 cells. This demonstrated- the feasibility of 
utilizing these helper viruses to infect the cell line 
which has been transf acted with the Ad5 deleted virus. 




Table 3 



15 

Adenovirus serotypes 

p/ml 

pfu/ml 

p:pfu 


2 

5 X 10« 

2.5 x 10 11 

20:01 


3 

1 x io 12 

6.25 x 10 9 

160:1 

20 

4 

3 x 10 12 

2 X 10 9 

150:1 


5 

1 x io 12 

5 X 10 10 

20:01 

25 

7a 

5 x 10 12 

1 X 10 11 

50:1 


12 

6 X 10 11 

4 X 10 9 

150:1 


35 

1.2 X 10 12 



30 

40 

2.2 X 10 12 

4.4 X 10 a 

5000:1 


Table 4 summarizes the results of the final 
purified fractions. The middle column, labeled LFO/ M l 
quantifies the production of XacZ forming units, which is 
a direct measure of the packaging and propagation of 
pseudotyped recombinant AdA vims. The pfu//il titer is 
an estimate of the contaminating wild type virus. AdA 
virus pseudotyped with all adenoviral strains was 
generated except for Ad3. The titers range between io 7 - 
10 4 . 
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Table 4 



Serotypes 

LFU/ml 

PFU/ml 

5 

2 


1.8 X 10 9 


3 

0 

NA 

10 

4 

6.7 X 10 s 

9.3 X 10 7 


. 5 

6.3 X 10 7 

1.9 X 10 9 


7a 

3 X 10 s 

1.8 X 10 8 

15 

12 

1.2 X 10 5 

3.3 X 10 8 


40 

9.5 X 10 4 

1.5 X 10 3 


Table 5A-5D represents a more detailed analysis 
of the fractions from the second purification for each of 
the experiments summarized in Table 4. Again, LFU/Ml i* 
25 the recovery of the AdA viruses, whereas pfu/jil 
represents recovery of the helper virus. 




Table 

5A 


30 

Ad2 Fraction # 

VOLUME/ul 

LFU/Ul 

PFU/Ul 


1 

120 

9532 

8 X 10 6 


2 

100 

5.8 X 10 4 

3 X 10 6 

35 

3 

100 

8.24 X 10 4 

6 X 10 s 


4 

100 

9.47 X 10 4 

1.2 X 10 5 

40 

5 

100 

6 X 10 4 

8 X 10 4 


6 

100 

2 X 10 4 

6 X 10 4 


7 

100 

5434 

5 X 10 4 

45 

Total/ 10 pH 


3.32 X 10 7 

1.35 X 10 9 


50 
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Table 5B 



M4 Fraction / 

VOLUME/ul 

LFU/ul 

PFU/ul 

10 

1 

100 

1000 

1.75 X 10 5 



100 

1.79 x 10 4 

2.8 X 10 s 



100 

1.8 X 10 4 

5.5 x 10 4 

15 


100 

2909 

1.25 x 10 4 


5 

100 

920 

4 X 10 4 

20 


100 

153 

3 x 10 3 


Total/ 10 pH 


4 X 10 6 

5.6 x 10 7 

25 

MS Fraction # 





1 

120 

1.98 x 10 4 

6 X 10 s 

30 

2 

100 

5.8 x 10 4 

3 X 10 6 


3 

100 

1.2 X 10 s 

1.5 x 10 6 


4 

100 

1 X 10 s 

1.4 X 10 s 

35 

5 

100 

7.96 X 10 4 

8 X 10 4 


6 

100 

6860 

6 X 10 4 

40 

Total/10 pH 


3.88 x 10 7 

1.2 X 10 9 
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Table 5C 


5 

Ad7 Fraction # 

VOLUME/Ul 

LFU/Ul 

PFU/ul 


1 

100 

1225 

5 X 10 5 


2 

100 

5550 

4 X 10 5 

10 

3 

100 

4938 

2 X 10 5 


4 

100 

3866 

8 X 10 4 

15 

5 

100 

4134 

6 X 10 4 


6 

100 

995 

7 X 10 4 


7 

100 

230 

6 x 10 3 

20 

Total/ 10 pH 


2.09 X 10 6 

1.3 x 10 8 

25 

Adl2 Fraction # 





1 

100 

31 

5 X 10 s 


2 

80 

169 

8.5 X 10 s 

30 

3 

80 

245 

1.8 X 10 s 


4 

110 

161 

1.1 x 10 s 

35 

5 

120 

62 

7 x 10 3 


Total/ 10 pH 


6.14 X 10 4 

1.65 x 10 8 
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5 

Ad40 Fraction # 

VOLDME/ul 

LFU/ul 


l 

80 

61 


2 

80 

184 

10 

3 

80 

199 


4 ^ 

80 

168 

15 

5 

80 

122 


6 

100 

46 


7 

100 

32 

20 

Total/ 10 pH 


6.65 X 10 4 


PFU/Ul 


1.1 X 10 3 

c - gharagtgrigation of the stnetair f Qf p ae ic aq ^ 

Viruses 

Aliquots of serial fractions were analysed by 
Southern blots using lac* as a probe, in the case of Ad2 
and 5, not only the linearized monomer was packaged but 
multiple forms of recombinant virus with distinct sizes 
were found. These forms correlated well with the sizes 
of diners, trimers and other higher molecular weight 
concatamers. The linearized monomers peaked closer to 
the top of tube (the defective adenovirus band) than 
other forms. When these forms were correlated with lac* 
activity, a better correlation was found between the 
higher molecular weight forms than the monomers. With 
pseudotyping of Ad4 and Ad7, no linearized monomers were 
packaged and only higher molecular weight forms were 
found. 

These data definitively demonstrate the 
production and characterization f the A virus and the 
different pseud types. This example illustrates a very 
simple way of generating pseud type viruses. 
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jfrpm pTi »• - *«™ v^or containing a FH gWlfi 

Familial hypercholesterolemia (FH) is an autosomal 
dominant disorder caused by abnormalities (deficiencies) 
in the function or expression of LDL receptors [M.S. 
Brown and J.L. Goldstein, Sffiifejss, 222(4746) : 34-37 
(1986) ; J.L. Goldstein and H.S. Brown, "Familial 
hypercholesterolemia" in Metabolic Baffin of Inherited, 
niaeaae. . ed. C.R. Scriver et al, McGraw Hill, New York, 
ppl215-1250 (1989).] Patients who inherit one abnormal 
allele have moderate elevations in plasma LDL and suffer 
premature life-threatening coronary artery disease (CAD). 
Homozygous patients have severe hypercholesterolemia and 
life-threatening CAD in childhood. An FH-containing 
vector of the invention is constructed by replacing the 
lacZ minigene in the pAdAc.CMVlacZ vector with a minigene 
containing the LDL receptor gene [T. Yamamoto et al, 
cell . 22*27-38 (1984)] using known techniques and as 
described analogously for the dystrophin gene and CFTR in 
the preceding examples. Vectors bearing the LDL receptor 
gene can be readily constructed according to this 
invention. The resulting plasmid is termed pAdAc.CMV- 
LDL. 

This plasmid is useful in gene therapy of FH alone, 
or preferably, in the form of a conjugate prepared as 
described herein to substitute a normal LDL gene for the 
abnormal allele responsible for the gene. 

A. wv vivo Gene Therapy 

Ex vivo gene therapy can be performed by 
harvesting and establishing a primary culture of 
hepatocytes from a patient. Known techniques may be used 
to isolate and transduce the hepatocytes with the above 
vector (s) bearing the LDL receptor gene(s) . For example, 
techniques of c llagenase perfusi n developed for rabbit 
liver can be adapted for human tissue and used in 
transduction. Following transducti n, the h pat cytes 
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are removed from the tissue culture plates and relnfused 
Into the patient using known techniques, e.g. via a 
catheter placed into the inferior mesenteric vein. 
B. In Vivo Gene Therapy 
5 Desirably, the in vl.o approach to gene 

therapy, e.g. liver-directed, involves the use of the 
vectors and vector conjugates described above, A 
preferred treatment involves infusing a vector LOL 
conjugate of this invention into the peripheral 

10 circulation of the patient. The patient is then 

evaluated for change in serum lipids and liver tissues. 

The virus or conjugate can be used to infect 
hepatocytes in vivo by direct injection into a peripheral 
or portal vein (10 7 -10 8 pfu/kg) or retrograde into the 

15 biliary tract (same dose) . This effects gene transfer 
into the majority of hepatocytes. 

Treatments are repeated as necessary, e.g. 
weekly. Administration of a dose of virus equivalent to 
an MOI of approximately 20 (i.e. 20 pfu/hepatocyte) is 

20 anticipated to lead to high level gene expression in the 
majority of hepatocytes. 

All references recited above are incorporated herein 
by reference. Numerous modifications and variations of 
the present invention are included in the above- 

25 identified specification and are expected to be obvious 
to one of skill in the art. Such modifications and 
alternations to the compositions and processes of the 
present invention, such as various modifications to the 
PAC sequences or the shuttle vectors, or to other 

30 sequences of the vector, helper virus and minigene 

components, are believed to be encompassed in the scope 
of the claims appended hereto. 
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SEQUENCE LISTING 

) GENERAL INFORMATION: 

(i) APPLICANT: Trustees of the University of Pennsylvania 
Wilson, James M. 
Fisher, Krishna J. 
Chen, Shu- Jen 
Weitzman, Matthew 

fii) TITLE OF INVENTION: Improved Adenovirus and Methods 
l ix ; ij-i^a oJ Use hereof 

(iii) NUMBER OF SEQUENCES: 10 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Howson and Howson 

(B) STREET: Spring House Corporate Cntr, PO Box 457 

(C) CITY: Spring House 

(D) STATE: Pennsylvania 

(E) COUNTRY: USA 

(F) ZIP: 19477 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

C OPERATING SYSTEM: PC-DOS /MS-DOS 
(D) SOFTWARE: Patentln Release #1.0, Versxon #1.30 

(Vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/331,381 

(B) FILING DATE: 28-OCT-1994 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Bak, Mary B. 

(B) REGISTRATION NUMBER: 31,215 

(C) REFERENCE /DOCKET NUMBER: GNVPN. 008PCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 215-540-9200 

(B) TELEFAX: 215-540-5818 
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(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7897 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 


GAACTCGAGC 

AGCTGAAGCT 

TGAATTCCAT CATCAATAAT 

ATACCTTATT 

50 

TTGGATTGAA 

GCCAATATGA 

TAATGAGGGG GTGGAGTTTG 

TGACGTGGCG 

100 

CGGGGCGTGG 

GAACGGGGCG 

GGTGACGTAG GTTTTAGGGC 

GGAGTAACTT 

150 

GTATGTGTTG 

GGAATTGTAG 

TTTTCTTAAA ATGGGAAGTT 

ACGTAACGTG 

200 

GGAAAACGGA 

AGTGACGATT 

TGAGGAAGTT GTGGGTTTTT 

TGGCTTTCGT 

250 

TTCTGGGCGT 

AGGTTCGCGT 


TGTGGACTTT 

300 

AACCGTTACG 

TCATTTTTTA 

GTCCTATATA TACTCGCTCT 

GCACTTGGCC 

350 

c-rrrrrrACA 

CTGTGACTGA 

TTGAGCTGGT GCCGTGTCGA 

GTGGTGTTTT 

400 

TTTAATAGGT 

TTTCTTTTTT 

ACTGGTAAGG CTGACTGTTA 

GGCTGCCGCT 

450 

GTGAAGCGCT 

GTATGTTGTT 

CTGGAGCGGG AGGGTGCTAT 

TTTGCCTAGG 

500 

CAGGAGGGTT 

TTTCAGGTGT 

TTATGTGTTT TTCTCTCCTA TTAATTTTGT 

550 

TATACCTCCT 

ATGGGGGCTG 

TAATGTTGTC TCTACGCCTG 

CGGGTATGTA 

600 

TTCCCCCCAA 

GCTTGCATGC 

CTGCAGGTCG ACTCTAGAGG 

ATCCGAAAAA 

650 

ACCTCCCACA 

CCTCCCCCTG 

AACCTGAAAC ATAAAATGAA 

TGCAATTGTT 

700 

GTTGTTAACT 

TGTTTATTGC 

AGCTTATAAT GGTTACAAAT AAAGCAATAG 

750 

CATCACAAAT 

TTCACAAATA 

AAGCATTTTT TTCACTGCAT 

TCTAGTTGTG 

800 

GTTTGTCCAA 

ACTCATCAAT 

GTATCTTATC ATGTCTGGAT 

CCCCGCGGCC 

850 

GCCTAGAGTC 

GAGGCCGAGT 

TTGTCAGAAA GCAGACCAAA 

CAGCGGTTGG 

900 

AATAATAGCG 

AGAACAGAGA 

AATAGCGGCA AAAATAATAC 

CCGTATCACT 

950 

TTTGCTGATA 

TGGTTGATGT 

CATGTAGCCA AATCGGGAAA 

AACGGGAAGT 

1000 
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AGGCTCCCAT GATAAAAAAG TAAAAGAAAA A6AATAAACC GAACATCCAA 1050 

AAGTTTGTGT TTTTTAAATA GTACATAATG GATTTCCTTA CGCGAAATAC 1100 

GGGCAGACAT GGCCTGCCCG GTTATTATTA TTTTTGACAC CAGACCAACT 1150 

GGTAATGGTA GCGACCGGCG CTCAGCTGTA AJTCCGCCGA TACTGACGGG 1200 

CTCCAGGAGT CGTCGCCACC AATCCCCATA TGGAAACCGT CGATATTCAG 1250 

CCATGTGCCT TCTTCCGCGT GCAGCAGATG GCGATGGCTG CTTTCCATCA 1300 

GTTGCTGTTG ACTGTAGCGG CTGATGTTGA ACTGGAAGTC GCCGCGCCAC 1350 

TGGTGTGGGC CATAATTCAA TTCGCGCGTC CCGCAGCGCA GACCGTTTTC 1400 

GCTCGGGAAG ACGTACGGGG TATACATGTC TGACAATGGC AGATCCCAGC 1450 

GGTCAAAACA GGCGGCAGTA AGGCGGTCGG GATAGTTTTC TTGCGGCCCT 1500 

AATCCGAGCC AGTTTACCCG CTCTGCTACC TGCGCCAGCT GGCAGTTCAG 1550 

GCCAATCCGC GCCGGATGCG GTGTATCGCT CGCCACTTCA ACATCAACGG 1600 

TAATCGCCAT TTGACCACTA CCATCAATCC GGTAGGTTTT CCGGCTGATA 1650 

AATAAGGTTT TCCCCTGATG CTGCCACGCG TGAGCGGTCG TAATCAGCAC 1700 
CGCATCAGCA AGTGTATCTG CCGTGCACTG CAACAACGCT GCTTCGGCCT 
GGTAATGGCC CGCCGCCTTC CAGCGTTCGA CCCAGGCGTT AGGGTCAATG 
CGGGTCGCTT CACTTACGCC AATGTCGTTA TCCAGCGGTG CACGGGTGAA 

CTGATCGCGC AGCGGCGTCA GCAGTTGTTT TTTATCGCCA ATCCACATCT 1900 

GTGAAAGAAA GCCTGACTGG CGGTTAAATT GCCAACGCTT ATTACCCAGC 1950 

TCGATGCAAA AATCCATTTC GCTGGTGGTC AGATGCGGGA TGGCGTGGGA 2000 

CGCGGCGGGG AGCGTCACAC TGAGGTTTTC CGCCAGACGC CACTGCTGCC 2050 

AGGCGCTGAT GTGCCCGGCT TCTGACCATG CGGTCGCGTT CGGTTGCACT 2100 

ACGCGTACTG TGAGCCAGAG TTGCCCGGCG CTCTCCGGCT GCGGTAGTTC 2150 

AGGCAGTTCA ATCAACTGTT TACCTTGTGG AGCGACATCC AGAGGCACTT 2200 

CACCGCTTGC CAGCGGCTTA CCATCC&GCG CCACCATCCA GTGCAGGAGC 2250 
TCGTTATCGC TATGACGGAA CAGGTATTCG CTGGTCACTT CGATGGTTTC 


1750 
1800 
1850 


2300 
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CCCGGATAAA CGGAACTGGA AAAACTGCTG CTGGTGTTTT GCTTCCGTCA 2350 
GCGCTGGATG CGGCGTGC6G TCGGCAAAGA CCAGACCGTT CATACAGAAC 2400 
TGGCGATCGT TCGGCGTATC GCCAAAATCA CCGCCGTAAG CCGACCACGG 2450 
GTTGCCGTTT TCATCATATT TAATCAGCGA CTJATCCACC CAGTCCCAGA 2500 
CGAAGCCGCC CTGTAAACGG GGATACTGAC GAAACGCCTG CCAGTATOTA 2550 
GCGAAACCGC CAAGACTGTT ACCCATCGCG TGGGCGTATT CGCAAAGGAT 2600 
CAGCGGGCGC GTCTCTCCAG GTAGCGAAAG CCATTTTTTG ATGGACCATT 2650 

TCGGCACAGC CGGGAAGGGC TGGTCTTCAT CCACGCGCGC GTACATCGGG 2700 

CAAATAATAT CGGTGGCCGT GGTGTCGGCT CCGCCGCCTT CATACTGCAC 2750 

CGGGCGGGAA GGATCGACAG ATTTGATCCA GCGATACAGC GCGTCGTGAT 2800 

TAGCGCCGTG GCCTGATTCA TTCCCCAGCG ACCAGATGAT CACACTCGGG 2850 

TGATTACGAT CGCGCTGCAC CATTCGCGTT ACGCGTTCGC TCATCGCCGG 2900 

TAGCCAGCGC GGATCATCGG TCAGACGATT CATTGGCACC ATGCCGTGGG 2950 

TTTCAATATT GGCTTCATCC ACCACATACA GGCCGTAGCG GTCGCACAGC 3000 

GTGTACCACA GCGGATGGTT CGGATAATGC GAACAGCGCA CGGCGTTAAA 3050 

GTTGTTCTGC TTCATCAGCA GGATATCCTG CACCATCGTC TGCTCATCCA 3100 

TGACCTGACC ATGCAGAGGA TGATGCTCGT GACGGTTAAC GCCTCGAATC 3150 

AGCAACGGCT TGCCGTTCAG CAGCAGCAGA CCATTTTCAA TCCGCACCTC 3200 

GCGGAAACCG ACATCGCAGG CTTCTGCTTC AATCAGCGTG CCGTCGGCGG 3250 

TGTGCAGTTC AACCACCGCA CGATAGAGAT TCGGGATTTC GGCGCTCCAC 3300 

AGTTTCGGGT TTTCGACGTT CAGACGTAGT GTGACGCGAT CGGCATAACC 3350 

ACCACGCTCA TCGATAATTT CACCGCCGAA AGGCGCGGTG CCGCTGGCGA 3400 

CCTGCGTTTC ACCCTGCCAT AAAGAAACTG TTACCCGTAG GTAGTCACGC 3450 

AACTCGCCGC ACATCTGAAC TTCAGCCTCC AGTACAGCGC GGCTGAAATC 3500 

ATCATTAAAG CGAGTGGCAA CATGGAAATC GCTGATTTGT GTAGTOGGTT 3550 

TATGCAGCAA CGAGACGTCA CGGAAAATGC CGCTCATCCG CCACATATCC 3600 
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TGATCTTCCA GATAACTGCC GTCACTCCAA CGCAGCACCA TCACCGCGAG 
GCGGTTTTCT CCGGCGCGTA AAAATGCGCT CAGGTCAAAT TCAGACGGCA 
AACGACTGTC CTGGCCGTAA CCGACCCAGC GCCCGTTCCA CCACAGATGA 
AACGCCGAGT TAACGCCATC AAAAATAATT O.CGTCTGGC CTTCCTGTAG 
CCAGCTTTCA TCAACATTAA ATGTGAGCGA OTAACAACCC GTCGGATTCT 
CCGTGGGAAC AAACGGCGGA TTGACCGTAA TGGGATAGGT TACGTTGGTG 
TAGATGGGCG CATCGTAACC GTGCATCTGC CAGTTTGAGG GGACGACGAC 
AGTATCGGCC TCAGGAAGAT CGCACTCCAG CCAGCTTTCC GGCACCGCTT 
CTGGTGCCGG AAACCAGGCA AAGCGCCATT CGCCATTCAG GCTGCGCAAC 
TGTTGGGAAG GGCGATCGGT GCGGGCCTCT TCGCTATTAC GCCAGCTGGC 
CAAAGGGGGA TGTGCTGCAA GGCGATTAAG TTGGGTAACG CCAGGGTTTT 
CCCAGTCACG ACGTTGTAAA ACGACGGGAT CGCGCTTGAG CAGCTCCTTG 
CTGGTGTCCA GACCAATGCC TCCCAGACCG GCAACGAAAA TCACGTTCTT 
GTTGGTCAAA GTAAACGACA TGGTGACTTC TTTTTTGCTT TAGCAGGCTC 
TTTCGATCCC CGGGAATTGC GGCCGCGGGT ACAATTCCGC AGCTTTTAGA 
GCAGAAGTAA CACTTCCGTA CAGGCCTAGA AGTAAAGGCA ACATCCACTG 
AGGAGCAGTT CTTTGATTTG CACCACCACC GGATCCGGGA CCTGAAATAA 
AAGACAAAAA GACTAAACTT ACCAGTTAAC TTTCTGGTTT TTCAGTTCCT 
CGAGTACCGG ATCCTCTAGA GTCOGGAGGC TCGATCGGTC CCGGTCTCTT 
CTATCGAGGT CAAAACAGCG TGGATGGCGT CTCCAGGCGA TCTGACGGTT 
CACTAAACGA GCTCTGCTTA TATAGACCTC CCACCGTACA CGCCTACCGC 
CCATTTGCGT CAATGGGGCG GAGTTGTTAC GACATTTTGG AAAGTCCCGT 
TGATTTTGGT GCCAAAACAA ACTCCCATTG ACGTCAATGG GGTGGAGACT 
TGGAAATCCC CGTGAGTCAA ACCGCTATCC ACGCCCATTG ATGTACTGCC 
AAAACCGCAT CACCATGGTA ATAGCGATGA CTAATACGTA GATGTACTGC 
CAAGTAGGAA AGTCCCATAA GGTCATGTAC TGGGCATAAT GCCAGGCGGG 


3650 

37OO 

3750 

3800 

3850 

3900 

3950 

4000 

4050 

4100 

4150 

4200 

4250 

4300 

4350 

4400 

4450 

4500 

4550 

4600 

4650 

4700 

4750 

4800 

4850 

4900 
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CCATTTACCG TCATTGACGT CAATAGGGGG CGTACTTGGC ATATGATACA 4950 

CTTGATGTAC TGCCAAGTGG GCAGTTTACC GTAAATACTC CACCCATTGA 5000 

CGTCAATGGA AAGTCCCTAT TGGCGTTACT ATGGGAACAT ACGTCATTAT 5050 

TGACGTCAAT GGGCGGGGGT CGTTGGGCGG TO.GCCAGGC GGGCCATTTA 5100 

CCGTAAGTTA TGTAACGACC TGCAGGTCGA CTCTAGAGGA TCTCCCTAGA 5150 

CAAATATTAC GCGCTATGAG TAACACAAAA TTATTCAGAT TTCACTTCCT 5200 

CTTATTCAGT TTTCCCGCGA AAATGGCCAA ATCTTACTCG GTTACGCCCA 5250 

AATTTACTAC AACATCCGCC TAAAACCGCG CGAAAATTGT CACTTCCTGT 5300 

GTACACCGGC GCACACCAAA AACGTCACTT TTGCCACATC CGTCGCTTAC 5350 

ATGTGTTCCG CCACACTTGC AACATCACAC TTCCGCCACA CTACTACGTC 5400 

ACCCGCCCCG TTCCCACGCC CCGCGCCACG TCACAAACTC CACCCCCTCA 5450 

TTATCATATT GGCTTCAATC CAAAATAAGG TATATTATTG ATGATGCTAG 5500 

CGAATTCATC GATGATATCA GATCTGCCGG TCTCCCTATA GTGAGTCGTA 5550 

TTAATTTCGA TAAGCCAGGT TAACCTGCAT TAATGAATCG GCCAACGCGC 5600 

GGGGAGAGGC GGTTTGCGTA TTGGGCGCTC TTCCGCTTCC TCGCTCACTG 5650 

ACTCGCTGCG CTCGGTCGTT CGGCTGCGGC GAGCGGTATC AGCTCACTCA 5700 

AAGGCGGTAA TACGGTXATC CACAGAATCA GGGGATAACG CAGGAAAGAA 5750 

CATGTGAGCA AAAGGCCAGC AAAAGGCCAG GAACCGTAAA AAGGCCGCGT 5800 

TGCTGGCGTT TTTCCATAGG CTCCGCCCCC CTGACGAGCA TCACAAAAAT 5850 

CGACGCTCAA GTCAGAGGTG GCGAAACCCG ACAGGACTAT AAAGATACCA 5900 

GGCGTTTCCC CCTGGAAGCT CCCTCGTGCG CTCTCCTGTT CCGACCCTGC 5950 

CGCTTACCGG ATACCTGTCC GCCTTTCTCC CTTCGGGAAG CGTGGCGCTT 6000 

TCTCAATGCT CAOGCTGTAG GTATCTCAGT TCGGTGTAGG TCGTTCGCTC 6050 

CAAGCTGGGC TGTGTGCACG AACCCCCCGT TCAGCCCGAC CGCTGCGCCT 6100 

TATCCGGTAA CTATCGTCTT GAGTCCAACC CGGTAAGACA CGACTTATCG 6150 

CCACTGGCAG CAGCCACTGG TAACAGGATT AGCAGAGCGA GGTATGTAGG 6200 
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CGGTGCTACA GAGTTCTTGA AGTGGTGGCC TAACTACGGC TACACTAGAA 
GGACAGTATT TGGTATCTGC GCTCTGCTGA AGCCAGTTAC CTTCGGAAAA 
AGAGTTGGTA GCTCTTGATC CGGCAAACAA ACCACCGCTG CTAGCGGTGG 
mTTTTGTT TGCAAGCAGC AGATTACGCG CaGAAAAAAA GGATCTCAAG 
AAGATCCTTT GATCTTTTCT ACGGGGTCTG ACGCTCAGTG GAACGAAAAC 
TCACGTTAAG GGATTTTGGT CATGAGATTA TCAAAAAGGA TCTTCACCTA 
GATCCTTTTA AATTAAAAAT GAAGTTTTAA ATCAATCTAA AGTATATATG 
AGTAAACTTG GTCTGACAGT TACCAATGCT TAATCAGTGA GGCACCTATC 
TCAGCGATCT GTCTATTTCG TTCATCCATA GTTGCCTGAC TCCCCGTCGT 
GTAGATAACT ACGATACGGG AGGGCTTACC ATCTGGCCCC AGTGCTGCAA 
TGATACCGCG AGACCCACGC TCACCGGCTC CAGATTTATC AGCAATAAAC 
CAGCCAGCCG GAAGGGCCGA GOGCAGAAGT GGTCCTGCAA CTTTATCCGC 
CTCCATCCAG TCTATTAATT GTTGCCGGGA AGCTAGAGTA AGTAGTTCGC 
CAGTTAATAG TTTGCGCAAC GTTGTTGCCA TTGCTACAGG CATCGTGGTG 
TCACGCTCGT CGTTTGGTAT GGCTTCATTC AGCTCCGGTT CCCAACGATC 
AAGGCGAGTT ACATGATCCC CCATGTTGTG CAAAAAAGCG GTTAGCTCCT 
TCGGTCCTCC GATCGTTGTC AGAAGTAAGT TGGCCGCAGT GTTATCACTC 
ATGGTTATGG CAGCACTGCA TAATTCTCTT ACTGTCATGC CATCCGTAAG 
ATGCTTTTCT GTGACTGGTG AGTACTCAAC CAAGTCATTC TGAGAATAGT 
GTATGCGGCG ACCGAGTTGC TCTTGCCCGG CGTCAATACG GGATAATACC 
GCGCCACATA GCAGAACTTT AAAAGTGCTC ATCATTGGAA AACGTTCTTC 
GGGGCGAAAA CTCTCAAGGA TCTTACCGCT GTTGAGATCC AGTTCGATGT 
AACCCACTCG TGCACCCAAC TGATCTTCAG CATCTTTTAC TTTCACCAGC 
GTTTCTGGGT GAGCAAAAAC AGGAAGGCAA AATGCCGCAA AAAAGGGAAT 
AAGGGCGACA CGGAAATGTT GAATACTCAT ACTCTTCOT TTTCAATATT 
ATTGAAGCAT TTATCAGGGT TATTGTCTCA TGAGCGGATA CATATTTGAA 


6250 

6300 

6350 

6400 

6450 

6500 

6550 

6600 

6650 

6700 

6750 

6800 

6850 

6900 

6950 

7000 

7050 

7100 

7150 

7200 

7250 

7300 

7350 

7400 

7450 

7500 
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TGTATTTAGA AAAATAAACA AATAGGGGTT CCGCGCACAT TTCCCCGAAA 7550 

AGTGCCACCT GACGTCTAAG AAACCATTAT TATCATGACA TTAACCTATA 7600 

AAAATAGGCG TATCACGAGG CCCTTTCGTC TCGCGCGTTT CGGTGATGAC 7650 

GGTGAAAACC TCTGACACAT GCAGCTCCCG GAJACGGTCA CAGCTTGTCT 7700 

GTAAGCGGAT GCCGGGAGCA GACAAGCCCG TCAGGGCGCG TCAGCGGGTG 7750 

TTGGCGGGTG TCGGGGCTGG CTTAACTATG CGGCATGAGA GCAGATTGTA 7800 

CTGAGAGTGC ACCATATGGA CATATTGTCG TTAGAACGCG GCTACAATTA 7850 

ATACATAACC TTATGTATCA TACACATACG ATTTAGGTGA CACTATA 7897 

(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7852 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GAATTCGCTA GCTAGCGGGG GAATACATAC CCGCAGGCGT AGAGACAACA 50 

TTACAGCCCC CATAGGAGGT ATAACAAAAT TAATAGGAGA GAAAAACACA 100 

TAAACACCTG AAAAACCCTC CTGCCTAGGC AAAATAGCAC CCTCCCGCTC 150 

CAGAACAACA TACAGCGCTT CACAGCGGCA GCCTAACAGT CAGCCTTACC 200 

AGTAAAAAAG AAAACCTATT AAAAAAACAC CACTCGACAC GGCACCAGCT 250 

CAATCAGTCA CAGTGTAAAA AAGGGCCAAG TGCAGAGCGA GTATATATAG 300 

GACTAAAAAA TGACGTAACG GTTAAAGTCC ACAAAAAACA CCCAGAAAAC 350 

CGCACGCGAA CCTACGCCCA GAAACGAAAG CCAAAAAACC CACAACTTCC 400 

TCAAATCGTC ACTTCCGTTT TCCCACGTTA CGTAACTTCC CATTTTAAGA 450 

AAACTACAAT TCCCAACACA TACAAGTTAC TCCGCCCTAA AACCTACGTC 500 

ACCCGCCCCG TTCCCACGCC CCGCGCCACG TCACAAACTC CACCCCCTCA 550 
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600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 


TTATCATATT GGCTTCAATC CAAAATAAGG TATATTATTG ATGATGCTAG 
CATCATCAAT AATATACCTT ATTTTGGATT GAAGCCAATA TGATAATGAG 
GGGGTGGAGT TTGTGACGTG GCGCGGGGCG TGGGAACGGG GCGGGTGACG 
TAGTAGTGTG GCGGAAGTGT GATGTTGCAA G'xGTGGCGGA ACACATGTAA 
GCGACGGATG TGGCAAAAGT GACGTTTTTG GTGTGCGCCG GTGTACACAG 
GAAGTGACAA TTTTCGCGCG GTTTTAGGCG GATGTTGTAG TAAATTTGGG 
CGTAACCGAG TAAGATTTGG CCATTTTCGC GGGAAAACTG AATAAGAGGA 
AGTGAAATCT GAATAATTTT GTGTTACTCA TAGCGCGTAA TATTTGTCTA 
GGGAGATCAG CCTGCAGGTC GTTACATAAC TTACGGTAAA TGGCCCGCCT 
GGCTGACCGC CCAACGACCC CCGCCCATTG ACGTCAATAA TGACGTATGT 
TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA TGGGTGGAGT 
ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA 
AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA 
TGCCCAGTAC ATGACCTTAT GGGACTTTCC TACTTGGCAG TACATCTACG 1250 
TATTAGTCAT CGCTATTACC ATGGTGATGC GGTTTTGGCA GTACATCAAT 
GGGCGTGGAT AGCGGTTTGA CTCACGGGGA TTTCCAAGTC TCCACCCCAT 
TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG GACTTTCCAA 
AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA 
CGGTGGGAGG TCTATATAAG CAGA6CTCGT TTAGTCAACC GTCAGATCGC 
CTGGAGACGC CATCCACGCT GTTTTGACCT CCATAGAAGA CACCGGGACC 
GATCCAGCCT CCGGACTCTA GAGGATCCGG TACTCGAGGA ACTGAAAAAC 
CAGAAAGTTA ACTGGTAAGT TTAGTCTTTT TGTCTTTTAT TTCAGGTCCC 
GGATCCGGTC GTGGTGCAAA TCAAAGAACT GCTCCTCAGT GGATGTTGCC 
TTTACTTCTA GGCCTGTACG GAAGTGTTAC TTCTGCTCTA AAAGCTGCGG 
AATTGTACCC GCGGCCGCAA TTCCCGGGGA TCGAAAGAGC CTGCTAAA6C 
AAAAAAGAAG TCACCATGTC GTTTACTTTG ACCAACAAGA ACGTGATTTT 1850 


1300 
1350 
1400 
1450 
1500 
1550 
1600 
1650 
1700 
1750 
1800 
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CGTTGCCGGT 

CTGGGAGGCA 

TTGGTCTGGA 

CACCAGCAAG 

GAGCTGCTCA 

1900 

AGCGCGATCC 

CGTCGTTTTA 

CAACGTCGTG 

ACTGGGAAAA 

CCCTGGCGTT 

1950 

ACCCAACTTA 

ATCGCCTTGC 

AGCACATCCC 

CCTTTCGCCA 

GCTGGCGTAA 

2000 

TAGCGAA6AG 

GCCCGCACCG 

ATCGCCCTTC 

CCAACAGTTG 

CGCAGCCTGA 

2050 

ATGGCGAATG 

GCGCTTTGCC 

TGGTTTCCGG 

CACCAGAAGC 

GGTGCCGGAA 

2100 

AGCTGGCTGG 

AGTGCGATCT 

TCCTGAGGCC 

GATACTGTCG 

TCGTCCCCTC 

2150 

AAACTGGCAG 

ATGCACGGTT 

ACGATGCGCC 

CATCTACACC 

AACGTAACCT 

2200 

ATCCCATTAC 

GGTCAATCCG 

CCGTTTGTTC 

CCACGGAGAA 

TCCGACGGGT 

2250 

TGTTACTCGC 

TCACATTTAA 

TGTTGATGAA 

AGCTGGCTAC 

AGGAAGGCCA 

2300 

GACGCGAATT 

ATTTTTGATG 

GCGTTAACTC 

GGCGTTTCAT 

CTCTGGTGCA 

2350 

ACGGGCGCTG 

GGTCGGTTAC 

GGCCAGGAGA 

GTCGTTTGCC 

GTCTGAATTT 

2400 

GACCTGAGCG 

CATTTTTACG 

CGCCGGAGAA AACCGCCTCG 

CGGTGATGGT 

2450 

GCTGCGTTGG 

AGTGACGGCA 

GTTATCTGGA 

AGATCAGGAT 

ATGTGGCGGA 

2500 

TGAGCGGCAT 

TTTCCGTGAC 


TGCATAAACC 

GACTACACAA 

2550 

ATCAGCGATT 

TCCATGTTGC 

CACTCGCTTT 

AATGATGATT 

TCAGCCGCGC 

2600 

TGTACTGGAG 

GCTGAAGTTC 

AGATGTGCGG 

CGAGTTGCGT 

GACTACCTAC 

2650 

GGGTAACAGT 

TTCTTTATGG 

CAGGGTGAAA 

CGCAGGTCGC 

CAGCGGCACC 

2700 

GCGCCTTTCG 

GCGGTGAAAT 

TATCGATGAG 

CGTGGTGGTT 

ATGCCGATCG 

2750 

CGTCACACTA 

CGTCTGAACG 

TCGAAAACCC 

GAAACTGTGG 

AGCGCCGAAA 

2800 

TCCCGAATCT 

CTATCGTGCG 

GTGGTTGAAC 

TGCACACCGC 

CGACGGCACG 

2850 

CTGATTGAAG 

CAGAAGCCTG 

CGATGTCGGT 

TTCCGCGAGG 

TGCGGATTGA 

2900 

AAATGGTCTG 

CTGCTGCTGA ACGGCAAGCC GTTGCTGATT 

CGAGGCGTTA 

2950 

ACCGTCACGA 

GCATCATCCT 

CTGCATGGTC 

AGGTCATGGA 

TGAGCAGACC 

3000 

ATGGTGCAGG 

ATATCCTGCT 

GATGAAGCAG 

AACAACTTTA ACGCCGTGCG 

3050 

CTGTTCGCAT 

TATCCGAACC 

ATCCGCTGTG 

GTACACGCTG 

TGCGACCGCT 

3100 

ACGGCCTGTA 

TGTGGTGGAT 

GAAGCCAATA 

TTGAAACCCA 

CGGCATGGTG 

3150 
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CCAATGAATC GTCTGACCGA TGATCCGCGC TGGCTACCGG CGATGAGCGA 3200 

ACGCGTAACG CGAATGGTGC AGCGCGATCG TAATCACCCG AGTGTGATCA 3250 

TCTGCTCGCT GGGGAATGAA TCAGGCCACG GCGCTAATCA CGACGCGCTG 3300 

TATCGCTGGA TCAAATCTGT CGATCCTTCC O-CCCGGTCC AGTATGAAGG 3350 

CGGCGGAGCC GACACCACGG CCACCGATAT TATTTGCCCG ATGTACGCGC 3400 

GCGTGGATGA AGACCAGCCC TTCCCGGCTG TGCCGAAATG GTCCATCAAA 3450 

AAATGGCTTT CGCTACCTGG AGAGACGCGC CCGCTGATCC TTTGCGAATA 3500 

CGCCCACGCG ATGGGTAACA GTCTTGGCGG TTTCGCTAAA TACTGGCAGG 3550 

CGTTTCGTCA GTATCCCCGT TTACAGGGCG GCTTCGTCTG GGACTGGGTG 3600 

GATCAGTCGC TGATTAAATA TGATGAAAAC GGCAACCCGT GGTCGGCTTA 3650 

CGGCGGTGAT TTTGGCGATA CGCCGAACGA TCGCCAGTTC TGTATGAACG 3700 

GTCTGGTCTT TGCCGACCGC ACGCCGCATC CAGCGCTGAC GGAAGCAAAA 3750 

CACCAGCAGC AGTTTTTCCA GTTCCGTTTA TCCGGGCAAA CCATCGAAGT 3800 

GACCAGCGAA TACCTGTTCC GTCATAGCGA TAACGAGCTC CTGCACTGGA 3850 

TGGTGGCGCT GGATGGTAAG CCGCTGGCAA GCGGTGAAGT GCCTCTGGAT 3900 

GTCGCTCCAC AAGGTAAACA GTTGATTGAA CTGCCTGAAC TACCGCAGCC 3950 

GGAGAGCGCC GGGCAACTCT GGCTCACAGT ACGCGTAGTG CAACCGAACG 4000 

CGACCGCATG GTCAGAAGCC GGGCACATCA GCGCCTGGCA GCAGTGGCGT 4050 
CTGGCGGAAA ACCTCAGTGT GACGCTCCCC GCCGCGTCCC ACGCCATCCC 
GCATCTGACC ACCAGCGAAA TGGATTTTTG CATCGAGCTG GGTAATAAGC 

GTTGGCAATT TAACCGCCAG TCAGGCTTTC TTTCACAGAT GTGGATTGGC 4200 

GATAAAAAAC AACTGCTGAC GCCGCTGCGC GATCAGTTCA CCCGTGCACC 4250 

GCTGGATAAC GACATTGGCG TAAGTGAAGC GACCCGCATT GACCCTAACG 4300 

CCTGGGTCGA ACGCTGGAAG GCGGCGGGCC ATTACCAGGC CGAAGCAGCG 4350 

TTGTTGCAGT GCACGGCAGA TACACTTGCT GATGCGGTGC TGATTACGAC 4400 

CGCTCACGCG TGGCAGCATC AGGGGAAAAC CTTATTTATC AGCCGGAAAA 4450 


4100 
4150 
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CCTACCGGAT 
GTGGCGA6CG 
GGCGCAGGTA 
ACTATCCCGA 
TTGTCAGACA 
CTGCGGGACG 
TCCAGTTCAA 
CATCGCCATC 
TTTCCATATG 
CGGAATTACA 
CAAAAATAAT 
GAAATCCATT 
TATTCTTTTT 
CCGATTTGGC 
TATTTTTGCC 
GTCTGCTTTC 
AGACATGATA 
AGTGAAAAAA 
GTAACCATTA 
TTTTATGTTT 
TAGAGTCGAC 
GCTTCCGGCG 
GGTAGATGAC 
CCAGCCTAAC 
GCCTCGGCGA 
ATACCTTGTC 


TGATGGTAGT 
ATACACCGCA 
GCAGAGCGGG 
CCGCCTTACT 
TGTATACCCC 
CGCGAATTGA 
CATCAGCCGC 
TGCTGCACGC 
GGGATTGGTG 
GCTGAGCGCC 
AATAACCGGG 
ATGTACTATT 
CTTTTACTTT 
TACATGACAT 
GCTATTTCTC 
TGACAAACTC 
AGATACATTG 
ATGGTTTATT 
TAAGCTGCAA 
CAGGTTCAGG 
GACGCGAGGC 
GCATCGGGAT 
GACCATCAGG 
TTCGATCACT 
GCACATGGAA 
TGCCTCCCCG 


70 

GGTCAAATGG 
TCCGGCGCGG 
TAAACTGGCT 
GCCGCCTGTT 
GTACGTCTTC 
ATTATGGCCC 
TACAGTCAAC 
GGAAGAAGGC 
GCGACGACTC 
GGTCGCTACC 
CAGGCCATGT 
TAAAAAACAC 
TTTATCATGG 
CAACCATATC 
TGTTCTCGCT 
GGCCTCGACT 
ATGAGTTTGG 
TGTGAAATTT 
TAAACAAGTT 
GGGAGGTGTG 
TGGATGGCCT 
GCCCGCGTTG 
GACAGCTTCA 
GGACCGCTGA 
CGGGTTGGCA 
CGTTGCGTCG 


CGATTACCGT 
ATTGGCCTGA 
CGGATTAGGG 
TTuACCGCTG 
CCGAGCGAAA 
ACACCAGTGG 
AGCAACTGAT 
ACATGGCTGA 
CTGGAGCCCG 
ATTACCAGTT 
CTGCCCGTAT 
AAACTTTTGG 
GAGCCTACTT 
AGCAAAAGTG 
ATTATTCCAA 
CTAGGCGGCC 
ACAAACCACA 
GTGATGCTAT 
AACAACAACA 
GGAGGTTTTT 
TCCCCATTAT 
CAGGCCATGC 
AGGATCGCTC 
TCGTCACGGC 
TGGATTGTAG 
CGGTGCATGG 


TGATGTTGAA 
ACTGCCAGCT 
CCGCAAGAAA 
GGATCTGCCA 
ACGGTCTGCG 
CGCGGCGACT 
GGAAACCAGC 
ATATCGACGG 
TCAGTATCGG 
GGTCTGGTGT 
TTCGCGTAAG 
ATGTTCGGTT 
CCCGTTTTTC 
ATACGGGTAT 
CCGCTGTTTG 
GCGGGGATCC 
ACTAGAATGC 
TGCTTTATTT 
ATTGCATTCA 
TCGGATCCTC 
GATTCTTCTC 
TGTCCAGGCA 
GCGGCTCTTA 
GATTTATGCC 
GCGCCGCCCT 
AGCCGGGCCA 


4500 

4550 

4600 

4650 

4700 

4750 

4800 

4850 

4900 

4950 

5000 

5050 

5100 

5150 

5200 

5250 

5300 

5350 

5400 

5450 

5500 

5550 

5600 

5650 

5700 

5750 
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CCTCGACCTG AATGGAAGCC GGCGGCACCT CGCTAACGGA TTCACCACTC 5800 

CAAGAATTGG AGCCAATCAA TTCTTGCGGA GAACTGTGAA TGCGCAAACC 5850 

AACCCTTGGC AGAACATATC CATCGCGTCC GCCATCTCCA GCAGCCGCAC 5900 

GCGGCGCATC TCGGGCAGCG TTGGGTCCTG GJCACGGGT6 CGCATGATCG 5950 

TGCTCCTGTC GTTGAGGACC CGGCTAGGCT GGCGGGGTTG CCTTACTGGT 6000 

TAGCAGAATG AATCACCGAT ACGCGAGCGA ACGTGAAGCG ACTGCTGCTG 6050 

CAAAACGTCT GCGACCTGAG CAACAACATG AATGGTCTTC GGTTTCCGTG 6100 

TTTCGTAAAG TCTGGAAACG CGGAAGTCAG CGCCCTGCAC CATTATGTTC 6150 

CGGATCTGCA TCGCAGGATG CTGCTGGCTA CCCTGTGGAA CACCTACATC 6200 

TGTATTAACG AAGCCTTTCT CAATGCTCAC GCTGTAGGTA TCTCAGTTCG 6250 

GTGTAGGTCG TTCGCTCCAA GCTGGGCTGT GTGCACGAAC CCCCCGTTCA 6300 

GCCCGACCGC TGCGCCTTAT CCGGTAACTA TCGTCTTGAG TCCAACCCGG 6350 

TAAGACACGA CTTATCGCCA CTGGCAGCAG CCACTGGTAA CAGGATTAGC 6400 

AGAGCGAGGT ATGTAGGCGG TGCTACAGAG TTCTTGAAGT GGTGGCCTAA 6450 

CTACGGCTAC ACTAGAAGGA CAGTATTTGG TATCTGCGCT CTGCTGAAGC 6500 

CAGTTACCTT CGGAAAAAGA GTTGGTAGCT CTTGATCCGG CAAACAAACC 6550 

ACCGCTGGTA GCGGTGGTTT TTTTGTTTGC AAGCAGCAGA TTACGCGCAG 6600 

AAAAAAAGGA TCTCAAGAAG ATCCTTTGAT CTTTTCTACG GGGTCTGACG 6650 

CTCAGTGGAA CGAAAACTCA CGTTAAGGGA TTTTGGTCAT GAGATTATCA 6700 

AAAAGGATCT TCACCTAGAT CCTTTTAAAT TAAAAATGAA GTTTTAAATC 6750 

AATCTAAAGT ATATATGAGT AAACTTGGTC TGACAGTTAC CAATGCTTAA 6800 

TCAGTGAGGC ACCTATCTCA GCGATCTGTC TATTTCGTTC ATCCATAGTT 6850 

GCCTGACTCC CCGTCGTGTA GATAACTACG ATACGGGAGG GCTTACCATC 6900 

TGGCCCCAGT GCTGCAATGA TACCGCGAGA CCCACGCTCA CCGGCTCCAG 6950 

ATTTATCAGC AATAAACCAG CCAGCCGGAA GGGCCGAGCG CAGAAGTGGT 7000 

CCTGCAACTT TATCCGCCTC CATCCAGTCT ATTAATTGTT GCCGGGAAGC 7050 
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TAGAGTAAGT 

AGTTCGCCAG 

TTAATAGTTT 

GCGCAACGTT 

GTTGCCATTG 

7100 

CTGCAGGCAT 

CGTGGTGTCA 

CGCTCGTCGT 

TTGGTATGGC 

TTCATTCA6C 

7150 

TCCGGTTCCC 

AACGATCAAG 

GCGAGTTACA 

TCATCCCCCA 

TGTTGTGCAA 

7200 

AAAAGCGGTT 

AGCTCCTTCG 

GTCCTCCGAT 

CGTTGTCAGA 

AGTAAGTTGG 

7250 

CCGCAGTGTT 

ATCACTCATG 

GTTATGCCAG 

CACTGCATAA 

TTCTCTTACT 

7300 

GTCATGCCAT 

CCGTAAGATG 

CTTTTCTGTG 

ACTGGTGAGT 

ACTCAACCAA 

7350 

GTCATTCTGA 

GAATAGTGTA 

TGCGGC6ACC 

GAGTTGCTCT 

TGCCCG6CGT 

7400 

CAACACGGGA 

TAATACCGCG 

CCACATAGCA 

CAACTTTAAA AGTGCTCATC 

7450 

ATTGGAAAAC 

GTTCTTCGGG 

GCGAAAACTC 

TCAAGGATCT 

TACCGCTGTT 

7500 

GAGATCCAGT 

TCGATGTAAC 

CCACTCGTGC 

ACCCAACTGA 

TCTTCAGCAT 

7550 

CTTTTACTTT 

GACCAGCGTT 

TCTGGGTCAG 

CAAAAACAGG 

AAGGCAAAAT 

7600 

GCCGCAAAAA 

AGGGAATAAG 

GGCGACACGG 

AAATGTTGAA 

TACTCATACT 

7650 

CTTCCTTTTT 

CAATATTATT 

GAAGCATTTA TCAGGGTTAT TGTCTCATGA 

7700 

GCGGATACAT 

ATTTGAATGT 

ATTTAGAAAA 

ATAAACAAAT 

AGGGGTTCCG 

7750 

CGCACATTTC 

CCCGAAAA6T 

GCCACCTGAC 

GTCTAAGAAA 

CCATTATTAT 

7800 

CATGACATTA 

ACCTATAAAA 

ATAGGCGTAT 

CACGAGGCCC 


7850 

AA 





7852 


(2) INFORMATION FOR SEQ ID NO: 3: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9972 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cONA 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 3: 
TCTTCCGCTT CCTCGCTCAC TGACTCGCTG CGCTCGGTCG TTCGGCTGCG 50 
GCGAGCGGTA TCAGCTCACT CAAAGGCGGT AATACGGTTA TCCACAGAAT 100 
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CAGGGGATAA CGCAGGAAAG AACATGTGAG CAAAAGGCCA GCAAAAGGCC 150 


200 


AGGAACCGTA AAAAGGCCGC GTTGCTGGCG TTTTTCCATA GGCTCCGCCC 

CCCTGACGAG CATCACAAAA ATC6ACGCTC AAGTCAGAGG TGGCGAAACC 250 

CGACAGGACT ATAAAGAT AC CAGGCGTTTC CCCCTGGAAG CTCCCTCGTG 300 

CGCTCTCCTG TTCCGACCCT GCCGCTTACC GGATACCTGT CCGCCTTTCT 350 

CCCTTCGGGA AGCGTGGCGC TTTCTCATAG CTCACGCTGT AGGTATCTCA 400 

GTTCGGTGTA GGTCGTTCGC TCCAAGCTGG GCTGTGTGCA CGAACCCCCC 450 

GTTCAGCCCG ACCGCTGCGC CTTATCCGGT AACTATCGTC TTGAGTCCAA 500 

CCCGGTAAGA CACGACTTAT CGCCACTGGC AGCAGCCACT GGTAACAGGA 550 

TTAGCAGAGC GAGGTATGTA GGCGGTGCTA CAGAGTTCTT GAAGTGGTGG 600 

CCTAACTACG GCTACACTAG AAGAACAGTA TTTGGTATCT GCGCTCTGCT 650 

GAAGCCAGTT ACCTTCGGAA AAAGAGTTGG TAGCTCTTGA TCCGGCAAAC 700 

AAACCACCGC TGGTAGCGGT GGTTTTTTTG TTTGCAAGCA GCAGATTACG 750 

CGCAGAAAAA AAGGATCTCA AGAAGATCCT TTGATCCTTO CTACGGGGTC 800 
TGACGCTCAG TGGAACGAAA ACTCACGTTA AGGGATTTTG GTCATGAGAT 850 
TATCAAAAAG GATCTTCACC TAGATCCTTT TAAATTAAAA ATGAAGTTTT 900 
AAATCAATCT AAAGTATATA TGAGTAAACT TGGTCTGACA GTTACCAATG 950 

CTTAATCAGT GAGGCACCTA TCTCAGCGAT CTGTCTATTT CGTTCATCCA 1000 

TAGTTGCCTG ACTCCCCGTC GTGTAGATAA CTACGATACG GGAGGGCTTA 1050 

CCATCTGGCC CCAGTGCTGC AATGATACCG CCAGACCCAC GCTCACCGGC 1100 

TCCAGATTTA TCAGCAATAA ACCAGCCAGC CGGAAGGGCC GAGCGCAGAA 1150 

GTGGTCCTGC AACTTTATCC GCCTCCATCC AGTCTATTAA TTGTTGCCGG 1200 

GAAGCTAGAG TAAGTAGTTC GCCAGTTAAT AGTTTGCGCA ACGTTGTTGC 1250 

CATTGCTACA GGCATCGTGG TGTCACGCTC GTCGTTTGGT ATGGCTTCAT 1300 

TCAGCTCCGC TTCCCAACGA TCAAGGCSAG TTACATGATC CCCCATC5TTG 1350 

TGCAAAAAAG CGGTTAGCTC CTTCGGTCCT CCGATCGTTG TCAGAAGTAA 1400 
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GTTGGCCGCA 
TTACTGTCAT 
ACCAAGTCAT 
GGCGTCAATA 
TCATCATTGG 
CTGTTGAGAT 
AGCATCTTTT 
AAAATGCCGC 
ATACTCTTCC 
CATGAGCGGA 
TTCCGCGCAC 
ATTATCATGA 
TCTCGCGCGT 
CGGAGACGGT 
CGTCAGGGCG 
TGCGGCATCA 
TTAATATTTT 
TTTAACCAAT 
GCCCGAGATA 
TAAAGAACGT 
GATGGCCCAC 
GTGCCGTAAA 
CTTGACGGGG 
AAAGGAGCGG 
AACCACCACA 
GTTGCTTTGA 


GTGTTATCAC 
GCCATCCGTA 
TCTGAGAATA 
CGGGATAATA 
AAAACGTTCT 
CCAGTTCGAT 
ACTTTCACCA 
AAAAAAGGGA 
TTTTTCAATA 
TACATATTTG 
ATTTCCCCGA 
CATTAACCTA 
TTCGGTGATG 
CACAGCTTGT 
CGTCAGCGGG 
GAGCAGATTG 
GTTAAAATTC 
AGGCCGAAAT 
GGGTTGAGTG 
GGACTCCAAC 
TACGTGAACC 
GCACTAAATC 
AAAGCCGGCG 
GCGCTAGGGC 
CCCGCCGCGC 
CGTATGCGGT 


74 

TCATGGTTAT 
AGATGCTTTT 
GTGTATGCGG 
CC6CGCCACA 
TCGGGGCGAA 
GTAACCCACT 
GCGTTTCTGG 
ATAAGGGCGA 
TTATTGAAGC 
AATGTATTTA 
AAAGTGCCAC 
TAAAAATAGG 
ACGGTGAAAA 
CTGTAAGCGG 
TGTTGGCGGG 
TACTGAGAGT 
GCGTTAAATT 
CGGCAAAATC 
TTGTTCCAGT 
GTCAAAGGGC 
ATCACCCAAA 
GGAACCCTAA 
AACGTGGCGA 
GCTGGCAAGT 
TTAATGCGCC 
GTGAAATACC 


GGCAGCACTG 
CTGTGACTGG 
CGACCGAGTT 
TAGCAGAACT 
AACTCTCAAG 
CGTGCACCCA 
GTGAGCAAAA 
CACGGAAATG 
ATTTATCAGG 
GAAAAATAAA 
CTGACGTCTA 
CGTATCACGA 
CCTCTGACAC 
ATGCCGGGAG 
TGTCGGGGCT 
GCACCATAAA 
TTTGTTAAAT 
CCTTATAAAT 
TTGGAACAAG 
GAAAAACCGT 
TCAAGTTTTT 
AGGGAGCCCC 
GAAAGGAAGG 
GTAGCGGTCA 
GCTACAGGGC 
GCACAGATGC 


CATAATTCTC 
TGAGTACTCA 
GCTCTTGCCC 
TTAAAAGTGC 
GATCTTACCG 
ACTGATCTTC 
ACAGGAAGGC 
TTGAATACTC 
GTTATTGTCT 
CAAATAGGGG 
AGAAACCATT 
GGCCCTTTCG 
ATGCAGCTCC 
CAGACAAGCC 
GGCTTAACTA 
ATTGTAAACG 
CAGCTGATTT 
GAAAAGAATA 
AGTCCACTAT 
CTATCAGGGC 
TGGGGTCGAG 
CGATTTAGAG 
GAAGAAAGCG 
CGCTGCGCGT 
GOGTACTATG 
GTAAGGAGAA 


1450 

1500 

1550 

1600 

1650 

1700 

1750 

1800 

1850 

1900 

1950 

2000 

2050 

2100 

2150 

2200 

2250 

2300 

2350 

2400 

2450 

2500 

2550 

2600 

2650 

2700 
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AATACCGCAT CAGGCGCCAT TCGCCATTCA GGCTGCGCAA CTGTTGGGAA 2750 

GGGCGATCGG TGCGGGCCTC TTCGCTATTA CGCCAGCTGG CGAAAGGGGG 2800 

ATGTGCTGCA AGGCGATTAA GTTGGGTAAC GCCAGGGTTT TCCCAGTCAC 2850 

GACGTTGTAA AACGACGGCC AGTGCCAAGC 1VAAGGTGCA CGGCCCACGT 2900 

GGCCACTAGT ACTTCTCGAG CTCTGTACAT GTCCGCGGTC GCGACGTACG 2950 

CGTATCGATG GCGCCAGCTG CAGGCGGCCG CCATATGCAT CCTAGGCCTA 3000 

TTAATATTCC GGAGTATACG TAGCCGGCTA ACGTTAACAA CCGGTACCTC 3050 

TAGAACTATA GCTAGCCAAT TCCATCATCA ATAATATACC TTATTTTGGA 3100 

TTGAAGCCAA TATGATAATG AGCGGGTGGA GTTTGTGACG TGGCGCGGGG 3150 

CGTGGGAACG GGGCGGGTGA CGTAGGTTTT AGGGCGGAGT AACTTGTATG 3200 

TGTTGGGAAT TGTAGTTTTC TTAAAATGGG AAGTTACGTA ACGTGGGAAA 3250 

ACGGAAGTGA CGATTTGAGG AAGTTGTGGG TTTTTTGGCT TTCGTTTCTC 3300 

GGCGTAGGTT CGCGTGCGGT TTTCTGGGTG TTTTTTGTGG ACTTTAACCG 3350 

TTACGTCATT TTTTAGTCCT ATATATACTC GCTCTGCACT TGGCCCTTTT 3400 

TTACACTGTG ACTGATTGAG CTGGTGCCGT GTCGAGTGGT GTTTTTTTAA 3450 

TAGGTTTTCT TTTTTACTGG TAAGGCTGAC TGTTAGGCTG CCGCTGTGAA 3500 

GCGCTGTATG TTGTTCTGGA GCGGGAGGGT GCTATTTTGC CTAGGCAGGA 3550 

GGGTTTTTCA GGTGTTTATG TGTTTTTCTC TCCTATTAAT TTTGTTATAC 3600 

CTCCTATGGG GGCTGTAATG TTGTCTCTAC GCCTGCGGGT ATGTATTCCC 3650 

CCCAAGCTTG CATGCCTGCA GGTCGACTCT AGAGGATCCG AAAAAACCTC 3700 

CCACACCTCC CCCTGAACCT GAAACATAAA ATGAATGCAA TTGTTGTTGT 3750 

TAACTTGTTT ATTGCAGCTT ATAATGGTTA CAAATAAAGC AATAGCATCA 3800 

CAAATTTCAC AAATAAAGCA TTTTTTTCAC TGCATTCTAG TTGTGGTTTG 3850 
TCCAAACTCA TCAATGTATC TTATCATGTC TGGATCCCCC TAGCTTGCCA 
AACCTACAGG TGGGGTCTTT CATTCCCCCC TTTfECTGGA GACTAAATAA 

AATCTTTTAT TTTATCTATG GCTCGTACTC TATAGGCTTC AGCTGGTGAT 4000 


3900 
3950 
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ATTGTTGAGT CAAAACTAGA GCCTGGACCA CTGATATCCT 6TCTTTAACA 4050 

AATTGGACTA ATCGCGGGAT CAGCCAATTC CATGAGCAAA TGTCCCATGT 4100 

CAACATTTAT GCTGCTCTCT AAAGCCTTGT ATCTTGCATC TCTTCTTCTG 4150 

TCTCCTCTTT CAGAGCAGCA ATCTGGGGCT TAJACTTGCA CTTGCTTGAG 4200 

TTCCGGTGGG GAAAGA6CTT CACCCTGTCG GAGGGGCTGA TGGCTTGCCG 4250 

GAAGAGGCTC CTCTCGTTCA GCAGTTTCTG GATGGAATCG TACTGCCGCA 4300 

CTTTGTTCTC TTCTATGACC AAAAATTGTT GGCATTCCAG CATTGCTTCT 4350 

ATCCTGTGTT CACAGAGAAT TACTGTGCAA TCAGCAAATG CTTGTTTTAG 4400 

AGTTCTTCTA ATTATTTGGT ATGTTACTGG ATCCAAATGA GCACTGGGTT 4450 

CATCAAGCAG CAAGATCTTC GCCTTACTGA GAACAGATCT AGCCAAGCAC 4500 

ATCAACTGCT TGTGGCCATG GCTTAGGACA CAGCCCCCAT CCACAAGGAC 4550 

AAAGTCAAGC TTCCCAGGAA ACTGTTCTAT CACAGATCTG AGCCCAACCT 4600 

CATCTGCAAC TTTCCATATT TCTTGATCAC TCCACTGTTC ATAGGGATCC 4650 

AAGTTTTTTC TAAATGTTCC AGAAAAAATA AATACTTTCT GTGGTATCAC 4700 

TCCAAAGGCT TTCCTCCACT GTTGCAAAGT TATTGAATCC CAAGACACAC 4750 

CATCGATCTG GATTTCTCCT TCAGTGTTCA GTAGTCTCAA AAAAGCTGAT 4800 

AACAAAGTAC TCTTCCCTGA TCCAGTTCTT CCCAAGAGGC CCACCCTCTG 4850 

GCCAGGACTT ATTGAGAAGG AAATGTTCTC TAATATGGCA TTTCCACCTT 4900 

CTGTGTATTT TGCTGTGAGA TCTTTGACAG TCATTTGGCC CCCTGAGGGC 4950 

CAGATGTCAT CTTTCTTCAC GTGTGAATTC TCAATAATCA TAACTTTCGA 5000 

GAGTTGGCCA TTCTTGTATG GTTTGGTTGA CTTGGTAGGT TTACCTTCTG 5050 

TTGGCATGTC AATGAACTTA AAGACTCGGC TCACAGATCG CATCAAGCTA 5100 

TCCACATCTA TGCTGGAGTT TACAGCCCAC TGCAATGTAC TCATGATATT 5150 

CATGGCTAAA GTCAGGATAA TACCAACTCT TCCTTCTCCT TCTCCTGTTG 5200 

TTAAAATGGA AATGAACGTA ACAGCAATGA AGAAGATGAC AAAAATCATT 5250 

TCTATTCTCA TTTGGAACCA GCGCAGTGTT GACAGGTACA AGAACCAGTT 5300 
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GGCAGTATGT AAATTCAGAG CTTTGTGGAA CAGAGTTTCA AAGTAAGGCT 5350 

GCCGTCCGAA GGCACGAAGT GTCCATAGTC CTTTTAAGCT TGTAACAAGA 5400 

TGAGTGAAAA TTGGACTCCT GCCTTCAGAT TCCAGTTGTT TGAGTTGCTG 5450 

TGAGGTTTGG AGGAAATATG CTCTCAACAT AATAAAAGCC ACTATCACTG 5500 

GCACTGTTGC AACAAAGATG TAGGGTTGTA AAACTGCGAC AACTGCTATA 5550 

GCTCCAATCA CAATTAATAA CAACTGGATG AAGTCAAATA TGGTAAGAGG 5600 

CAGAAGGTCA TCCAAAATTG CTATATCTTT GGAGAATCTA TTAAGAATCC 5650 

CACCTGCTTT CAACGTGTTG AGGGTTGACA TAGGTGCTTG AAGAACAGAA 5700 

TGTAACATTT TGTGGTGTAA AATTTTCGAC ACTGTGATTA GAGTATGCAC 5750 

CAGTGGTAGA CCTCTGAAGA ATCCCATAGC AAGCAAAGTG TCGGCTACTC 5800 

CCACGTAAAT GTAAAACACA TAATACGAAC TGGTGCTGGT GATAATCACT 5850 

GCATAGCTGT TATTTCTACT ATGAGTACTA TTCCCTTTGT CTTGAAGAGG 5900 

AGTGTTTCCA AGGAGCCACA GCACAACCAA AGAAGCAGCC ACCTCTGCCA 5950 

GAAAAATTAC TAAGCACCAA ATTAGCACAA AAATTAAGCT CTTGTGGACA 6000 

GTAATATATC GAAGGTATGT GTTCCATGTA GTCACTGCTG GTATGCTCTC 6050 

CATATCATCA AAAAAGCACT CCTTTAAGTC TTCTTCGTTA ATTTCTTCAC 6100 

TTATTTCCAA GCCAGTTTCT TGAGATAACC TTCTTGAATA TATATCCAGT 6150 

TCAGTCAAGT TTGCCTGAGG GGCCAGTGAC ACTTTTCGTG TGGATGCTGT 6200 

TGTCTTTCGG TGAATGTTCT GACCTTGGTT AACTGAGTGT GTCATCAGGT 6250 

TCAGGACAGA CTGCCTCCTT CGTGCCTGAA GCGTGGGGCC AGTGCTGATC 6300 

ACGCTGATGC GAGGCAGTAT CGCCTCTCCC TGCTCAGAAT CTGGTACTAA 6350 

GGACAGCCTT CTCTCTAAAG GCTCATCAGA ATCCTCTTCG ATGCCATTCA 64 OO 

TTTGTAAGGG AGTCTTTTGC ACAATGGAAA ATTTTCGTAT AGAGTTGATT 6450 

GGATTGAGAA TAGAATTCTT CCTTTTTTCC CCAAACTCTC CAGTCTGTTT 6500 

AAAAGATTGT ' I ' l ' l ' l ' l ' lVlTl ' CTGTCCAGGA GACAGGAGCA TCTCCTTCTA 6550 

ATGAGAAACG GTGTAAGGTC TCAGTTAGGA TTGAATTTCT TCTTTCTGCA 6600 
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CTAAATTGGT 
CTGTAGATTT 
CATGCAAAAT 
GTGACCAAAA 
AAATATTTCT 
ATAAATACAA 
GCTCGTTGAC 
GTCTTTCTCT 
TGACGCTTCT 
TCTTTAATGG 
TCTTCCACTG 
TCATCATTAG 
AACAACTGTC 
ACCAAGAAGT 
TTTTTCTATT 
CCCTCCTCCC 
TAAGTTATAT 
TTTTGTTTAT 
AATTGCCGAG 
GAATATTTTC 
ATAAAAACAC 
TATCTCACAT 
GTTTTCAATC 
ATTGGATGTT 
TTCCCAGCTC 
CTGAAAAAGG 


CGAAAGAATC 
TGGAGTTCTG 
TAATATTTTG 
TCCTAGTTTT 
TTTTCTGTTA 
ATCAGCATCT 
CTCCACTCAG 
GCAAACTTGG 
GTATCTATAT 
TGCCAGGCAT 
TGCTTAATTT 
AAGTGAAGTC 
CTCTTTCTAT 
GAGAAATTAC 
GTTATTGTTT 
AGAAGGCTGT 
TCCAATGTCT 
TGCTCCAAGA 
TGACCGCCAT 
CGGAGGATGA 
CACAAAGAAC 
AGGCTGCCTT 
ATTTTTTCCA 
CTCGATCATT 
TCTGATCTCT 
GCAAGGACTA 


78 

ACATCCCATG AGTTTTGAGC TAAAGTCTGG 6650 
AAAATGTCCC ATAAAAATAG CTGCTACCTT 6700 
TCAGCTTTCT TTAAATGTTC CATTTTAGAA 6750 
GTTAGCCATC AGTTTACAGA CACAGCTTTC 6800 
AAACATCTAG GTATCCAAAA GGAGAGTCTA 6850 
TTGTATACTG CTCTTGCTAA AGAAATTCTT 6900 
TGTGATTCCA CCTTCTCCAA GAACTATATT 6950 

AGATGTCCTC TTCTAGTTGG CATGCTTTGA 7000 

TCATCATAGG AAACACCAAA GATGATATTT 7050 

AATCCAGGAA AACTGAGAAC AGAATGAAAT 7100 

TACCCTCTGA AGGCTCCAGT TCTCCCATAA 7150 

TTGCCTGCTC CAGTGGATCC AGCAACCGCC 7200 

CTTGAAATTA ATATCTTTCA GGACAGGAGT 7250 

TGAAGAAGAG GCTGTCATCA CCATTAGAAG 7300 

TGTTTTGCTT TCTCAAATAA TTCCCCAAAT 7350 

TACATTCTCC ATCACTACTT CTGTAGTCGT 7400 

TATATTCTTG CTTTTGTAAG AAATCCTGTA 7450 

GAGTCATACC ATGTTTGTAC AGCCCAGGGA 7500 

GCGCAGAACA ATGCAGAATG AGATGGTGGT 7550 

TTCCTTTGAT TAGTGCATAG GGAAGCACAG 7600 

CCTGAGAAGA AGAAGGCTGA GCTATTGAAG 7650 

CCGAGTCAGT TTCAGTTCTG TTTGTCTTAA 7700 

TTGCTTCTTC CCAGCAGTAT GCCTTAACAG 7750 

TCTGAGGTAA TCACAAGTCT TTCACTGATC 7800 

GTACTTCATC ATCATTCTCC CTAGCCCAGC 7850 

TCAGGAAACC AAGTCCACAG AAGGCAGACG 7900 
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CCTGTAACAA CTCCCAGATT AGCCCCATGA GGAGTGCCAC TTGCAAAGGA 7950 
GCGATCCACA CGAAATGTGC CAATGCAAGT CCTTCATCAA ATTTGTTCAG 8000 
GTTGTTGGAA AGGAGACTAA CAAGTTGTCC AATACTTATT TTATCTAGAA 8050 
CACGGCTTGA CAGCTTTAAA GTCTTCTTAT AAATCAAACT AAACATAGCT 8100 
ATTCTCATCT GCATTCCAAT GTGATGAAGG CCAAAAATGG CTGGGTGTAG 8150 
GAGCAGTGTC CTCACAATAA AGAGAAGGCA TAAGCCTATG CCTAGATAAA 
TCGCGATAGA GCGTTCCTCC TTGTTATCCG GGTCATAGGA AGCTATGATT 
CTTCCCAGTA AGAGAGGCTG TACTGCTTTG GTGACTTCCC CTAAATATAA 
AAAGATTCCA TAGAACATAA ATCTCCAGAA AAAACATCGC CGAAGGGCAT 
TAATGAGTTT AGGATTTTTC TTTGAAGCCA GCTCTCTATC CCATTCTCTT 8400 
TCCAATTTTT CAGATAGATT GTCAGCAGAA TCAACAGAAG GGATTTGGTA 8450 
TATGTCTGAC AATTCCAGGC GCTGTCTGTA TCCTTTCCTC AAAATTGGTC 
TGGTCCAGCT GAAAAAAAGT TTGGAGACAA CGCTGGCCTT TTCCAGAGGC 
GACCTCTGCA TGGTCTCTCG GGCGCTGGGG TCCCTGCTAG GGCCGTCTGG 
GCTCAAGCTC CTAATGCCAA AGGAATTCCT GCAGCCCGGG GGATCCACTA 8650 
GTTCTAGAGC GGCCGCCACC GCGGTGGCTG ATCCCGCTCC CGCCCGCCGC 
GCGCTTCGCT TTTTATAGGG CCGCCGCCGC CGCCGCCTCG CCATAAAAGG 
AAACTTTCGG AGCGCGCCGC TCTGATTGGC TGCCGCCGCA CCTCTCCGCC 
TCGCCCCGCC CCGCCCCTCG CCCCGCCCCG CCCCGCCTGG CGCGCGCCCC 
CCCCCCCCCC CCGCCCCCAT CGCTGCACAA AATAATTAAA AAATAAATAA 
ATACAAAATT GGGGGTGGGG AGGGGGGGGA GATGGGGAGA GTGAAGCAGA 
ACGTGGCCTC GAGTAGATGT ACTGCCAAGT AGGAAAGTCC CATAAGGTCA 9000 
TGTACTGGGC ATAATGCCAG GCGGGCCATT TACCGTCATT GACGTCAATA 
GGGGGCGTAC TTGGCATATG ATACACTTGA TGTACTGCCA AGTGGGCAGT 
TTACCGTAAA TACTCCACCC ATTGACGTCA ATGGAAAGTC CCTATTGGCG 
TTACTATGGG AACATACGTC ATTATTGACG TCAATGGGCG GGGGTCGTTG 


8200 
8250 
8300 
8350 


8500 
8550 
8600 


8700 
8750 
8800 
8850 
8900 
8950 


9050 
9100 
9150 
9200 
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GGCGGTCAGC CAGGCGGGCC ATTTACCGTA AGTTATGTAA CGACCTGCAG 9250 

GCTGATCTCC CTAGACAAAT ATTACGCGCT ATGAGTAACA CAAAATTATT 9300 

CAGATTTCAC TTCCTCTTAT TCAGTTTTCC CGCGAAAATG GCCAAATCTT 9350 

ACTCGGTTAC GCCCAAATTT ACTACAACAT CCOCCTAAAA CCGCGCGAAA 9400 

ATTGTCACTT CCTGTGTACA CCGGCGCACA CCAAAAACGT CACTTTTGCC 9450 

ACATCCGTCG CTTACATGTG TTCCGCCACA CTTGCAACAT CACACTTCCG 9500 

CCACACTACT ACGTCACCCG CCCCGTTCCC ACGCCCCGCG CCACGTCACA 9550 

AACTCCACCC CCTCATTATC ATATTGGCTT CAATCCAAAA TAAGGTATAT 9600 

TATTGATGAT GCTAGCATGC GCAAATTTAA AGCGCTGATA TCGATCGCGC 9650 

GCAGATCTGT CATGATGATC ATTGCAATTG GATCCATATA TAGGGCCCGG 9700 

GTTATAATTA CCTCAGGTCG ACGTCCCATG GCCATTCGAA TTCGTAATCA 9750 

TGGTCATAGC TGTTTCCTGT GTGAAATTGT TATCCGCTCA CAATTCCACA 9800 

CAACATACGA GCCGGAAGCA TAAAGTGTAA AGCCTGGGGT GCCTAATGAG 9850 

TGAGCTAACT CACATTAATT GCGTTGCGCT CACTGCCCGC TTTCCAGTCG 9900 

GGAAACCTGT CGTGCCAGCT GCATTAATGA ATCGGCCAAC GCGCGGGGAG 9950 

AGGCGGTTTG CGTATTGGGC GC 9972 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
TAGTAAATTT GGGC 14 
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(2) INFORMATION FOR 3EQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
AGTAAGATTT GGCC 


(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
AGTGAAATCT GAAT 


(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GAATAATTTT GTGT 


(2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: d uble 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYFE: DNA (genomic) , 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

CGTAATATTT GTCT 14 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

WANWTTTG 8 


(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19307 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CCAATTCCAT CATCAATAAT ATACCTTATT TTGGATTGAA GCCAATATGA 50 

TAATGAGGGG GTGGAGTTTG TGACGTGGCG CGGGGCGTGG GAACGGGGCG 100 

GGTGACGTAG GTTTTAGGGC GGAGTAACTT GTATGTGTTG GGAATTGTAG 150 

TTTTCTTAAA ATGGGAAGTT ACGTAACGTG GGAAAACGGA AGTGACGATT 200 

TGAGGAAGTT GTGGGTTTTT TGGCTTTCGT TTCTGGGCGT AGGTTCGCGT 250 

GCGGTTTTCT GGGTGTTTTT TGTGGACTTT AACCGTTACG TCATTTTTTA 300 

GTCCTATATA TACTCGCTCT GCACTTGGCC CTTTTTTACA CTGTGACTGA 350 

TTGAGCTGGT GCCGTGTCGA GTGGTGTTTT TTTAATAGGT TTTCTTTTTT 400 
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ACTGGTAAGG CTGACTGTTA GGCT6CC6CT GTGAAGCGCT 6TATGTTGTT 
CTGGAGCGGG AGGGTGCTAT TTTGCCTAGG CAGGAGGGTT TTTCAGGTGT 
TTATGTGTTT TTCTCTCCTA TTAATTTTGT TATACCTCCT ATGGGGGCTG 
TAATGTTGTC TCTACGCCTG CGGGTATGTA T.CCCCCCAA GCTTGCATGC 
CTGCAGGTCG ACTCTAGAGG ATCCGAAAAA ACCTCCCACA CCTCCCCCTG 
AACCTGAAAC ATAAAATGAA TGCAATTGTT GTTGTTAACT TGTTTATTGC 
AGCTTATAAT GGTTACAAAT AAAGCAATAG CATCACAAAT TTCACAAATA 
AAGCATTTTT TTCACTGCAT TCTAGTTGTG GTTTGTCCAA ACTCATCAAT 
GTATCTTATC ATGTCTGGAT CCCCGCGGCC GCTCTAGAAC TAGTGGATCC 
CCCGGGCTGC AGGAATTCCG TAACATAACT GCGTGCTTTA TTGAGATACA 
CAGTAAAGCA GTAATATAAT ACAATAGTAA GGCATATATT TGGTGAAATC 
TGATATGTTG TGAAAATGCA GTAAAACTGA AGTTTAAAAA AATAATTAGT 
AAATGTTACA GTGTTGGTGT TAAAACACAA TCTATTATGA TACTCAAGTA 
AGAGTCCAGT ACCTGGAGAC AATGATGATA CATGCCATGT GATGATTATG 
CTTCAGTTAC ACTGATTATG ATTTACACTT TAATACTTGA TGGTTATAAA 
GAACATGAAA TGATGTCCAA ATTATGCTTA AAATCAGCAA TAAAGCTCTC 
AGTTTTTATT CAAATATTTT GATAGATTCA CTCCAGAACT AATATCTAAA 
AGATAAAACG AAAAGATTAA AACAAAACTA TGCACTCTAT CTACCTTGGA 
TTTTAGAATG AAACTTAAAA CTTCTTAGTA GGAAAGGAAC CCCTTGTTTT 
AAATCTTGGT GAAAACAAAT CCTTGGATAA AGAAAATGCC CAGTGCCACA 
TAAAGGAGAG AGAGAGAGAA AAGCAAGACC AGAACCAAAT TTCAATTTGT 
TATCTTAGAG CTTTGGGTTT TCTTTTGGAA ATTATAAATG AAAAAAGGAA 
ACTGGTGTCC ACACAACAGA CAAGTGGTGA AGTTGTGAAA TTAGGTGTGC 
ACAATTACTA GAAACACCCC AAAACCAAAG TGAGGTAGAA ATAGCATGAG 
AAGCTGTGTT TGATGTTAAT TACAATTAAT AATGGACAAA ACCCACTCGC 
TAGAAGTTAA TTACACTTGA CGTTAGAGGT AACAGATTTG CAAAATGATA 
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GGACAGTGAT TTCTATTGAG AGAATGCTCT 
ACTGGCATGA GAGGAGTAAA GCTCTTCCTA 
GCACTTTTTC TCCTGGTTCA ATGACTTGCA 
CGTCAACTAG ACCAGAGAGT TTGGAGACGC 
AACCACTGTG CCTTCTCACC CACAATCCTG 
AACCAATGCA AAGGAGACAA ATGCAGTTCA 
CACCAGGGTC ACAATGTGAT TGGGTTACTT 
CTTGCAGCAT TAAAAAAAAA AATCATCACA 
AAAATCTAAA ATCTAAAATT CATCATCATC 
CAACAACAAA ACCACCCACT TCAGGTTGAG 
ATTTAGTTGT AATTATAGAG ATGTTTATAT 
CCATTCTTTT ACAGAGTTGT TGCTCCCCTC 
CGCAACCTTT AGCTCCTACC ATCTTCCTCC 
TGTCATCTGA TGTTCTATTG CAGAAACATC 
AGGAAGTTGA ATATATCAGC CAACAAATTA 
TATTCATTCG CATGTTCCTT GAAAAAAATG 
AAAGTTTAAA ACTAGAAACA TCTGGAGCCC 
CGGTAGTCTC CTGGCTTTGG GCTCCAGGGA 
AGATAAGCCC AGATGACTAG AAGCAATTTC 
ATTTGAAGAA GTAACTTCAT ATCTATTTAT 
TATACTTGTA GACATATAGA TGTATAAAAT 
ACTCAGTCAA CAATTCTCAA AAGAGCAATA 
GTTCGTATGC AAGAAAATAA AAAAACGTCA 
CGCTAAAGTA ATGCAAAACA ATGTGCTGCC 
TGTGTGTGTG GTGGGTTCGT GCATGTATGT 
GTGTGTGTGT GTGTGTGTGC GTGTGT G TTT 


TTAAATGCTA 

AGAAGAAGAA 

1750 

GCAGTCCTTA 

GCTTTCTGTT 

1800 

TTTGTTTAGA 

CATTTCAGCC 

1850 

TIJTGCTCTC 

AAAACTTTCC 

1900 

TGTGGAGTTA 

CTTGCAGGGA 

1950 

TGGGCTTCTG 

GACTGATATT 

2000 

TCTTAACAGT 

AATCCTAAGT 

2050 

ATGAAGAAAA 

AAAAACCCAA 

2100 

ATCAACAACA 

ACAACAACAA 

2150 

TTTATGAAGA 

GGGCAGAACA 

2200 

GTATAGTTGT 

AAATATTCAT 

2250 

ATATAAATTG 

ACTGAGGAGC 

2300 

TACTGTCTGG 

GAGTTAAAAA 

2350 

ATTAAATATA 

ACCCAACAGT 

2400 

CTATGATAGT 

AAGTCCTGTG 

2450 

AATCCTCTAG 

CTCTCAGTGG 

2500 

TAGACAATAT 

TTTAGTGTGG 

2550 

AAATTCACTC 

TTGCCCAAGC 

2600 

CATTAGGAAG 

TGGCAAGAAC 

2650 

CTATATACCT 

ATAGTATTTA 

2700 

GAAAGCCCAT 

AGCCAGCCCC 

2750 

TGAAGCAGTC 

ATTTGGTGGG 

2800 

TGAATTCCAT 

ATGAATACCA 

2850 

TCAGTGTGTG 

TGTGTGTGTG 

2900 

GTGCGTGTGT 

GTGTGTGTGT 

2950 

GTTTAGGGGT 

TTTTATAAAC 

3000 
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AACTTTTTTT 
TATAAATTTT 
TTATTTAGCT 
TTGCATAGAC 
AGAAACTACT 
AGATTTTATT 
ATATTATAAA 
TCCTTCTTCA 
AATCATCTGC 
GCTTTCCGGG 
TGCTCCATCA 
AAGATCTTCC 
GCAGCATAGG 
GACACCGTGG 
CTGTCTCAGT 
GGATTTGCAT 
GCCTCAGCAA 
CATCTCAGGA 
GCTTCAGGCG 
TCTGCTAGGA 
CAAGATCTGG 
AACTTTGGCA 
TTAGGAGAGA 
TGCTAGCCTG 
AAAGCTGGGG 
AGAGTAACGG 


ATAAAGCACA 
TAAACAACCC 
ATCAAGATTT 
GTGTAAAACC 
GAAATCTACA 
TCTTGTAAAC 
AACCATGCGG 
TCTGTCATGA 
CATGTGGAAA 
GGCATTTCTT 
CTTCTTCTAA 
TCACCCATAG 
CTGACTGCTA 
TGCCATTCAC 
CTATGTAACT 
CCTGGCTTCC 
TGAGCTCAGC 
GGAGATGGCA 
ATCATATTCT 
TTCTCTCTAG 
GCAGGACTAC 
GTAATGCTGG 
TGCTATCATT 
CTAGCATAAT 
GGACGAGGCA 
GAGTTTCCAT 


85 

CTTTAGTTTA 
AAAATGCGTT 
TACATGTTTT 
TGCCATTGTT 
GTATAGTACC 
TCTTACTGTC 
GAATCAGGAG 
CTGAAACTAA 
AGGCTTCCTA 
CCTCTTGAAC 
CCCTGTGCTT 
ATTCTGAAGT 
TCTGACCTCt 
GTTAGCTTCA 
GAGACTCCAG 
AGGCGTCCTT 
ATCCCTGGGA 
GTGGAGACAG 
GCTTGCAGAT 
CTCCCCTCTT 
GAGGCTGGCT 
ATTAACAAAT 
TAGATAAGAT 
GTTCAATGCG 
GGCGCAGAAT 
GTTGTCCCCC 


CAATCTCTCT 
CCATATAAAG 
CTTTTAACTT 
AACAAAACAA 
ACTACCCTTC 
TAATCCTCTT 
TTGTAAAACA 
GGACTCCATC 
CATTGTGTCC 
TAGGGAAGGA 
GTGTCCTGGG 
TTGACTGCCA 
GCAGAGAGGT 
GCCTGGGGCT 
CTGTTTATTG 
T6TGTTGGCG 
CTCTGAGGAG 
GCCTTTATGC 
TCCTGTTTTC 
TCCTCACTCT 
CAGGGGGGAG 
GTTCATCATC 
CCATTGCTGT 
TGAATGAGTA 
CTACTGGCCA 
TCTAACACAG 


TTATAACTGT 
AAATGGCAAG 
TTTTGTACAA 
TAACAGACTT 
ACAAAAATAT 
TGTTGTACGA 
TTTATTCTGC 
GCTCTGCCCA 
TCTCTCATTG 
GTTGTTGAGT 
GAGGACTCAG 
ACCACTCGGA 
GGAAGGAGAG 
GCTCCAGGAG 
TGGTCTTCCA 
CAGTAGCTTA 
AGGTGGGCAT 
TCATGCTGCT 
TTCCTCAAGA 
CTAAGGAAAT 
TCCTGGTTCA 
TATGCTCTCA 
TTTCCATTTC 
TCATCGTGTG 
GAAGTTGATC 
TCTGCACTGG 
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3100 


3150 


3200 


3250 
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3500 

' - ■ v ;vi 

3550 
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3900 


3950 


4000 


4050 


4100 


4150 


4200 


4250 
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CAGGTA6CCC 
TGTTTTTTAG 
GGA6TGCAAT 
TC6GCCA6AA 
GGCTTCTGTA 
TTGGCTTGAT 
CAGCCACACC 
CAGCTTCAAT 
CTCGGCTCAA 
TGGGATTTGA 
AAAAGCCAGT 
TCCAAGTGTG 
ACGGATCCTC 
GACAGATATC 
TCCAGACGAT 
CATGGGCTGG 
CAGCTGACAG 
AGCTTCATGG 
TAAAGACTGG 
GGGTCTCGTG 
TCCCAGGGAC 
AAAGTCCCTG 
CCTGTAGAAG 
TAAGGTGAGA 
GACACGATTG 
TGACTTTTTC 


ATTCGGGGAT 
TACCTTGGCG 
ACTCTACCAT 
AAAAAGCAAC 
CCTGAATCCA 
GCTTGGCAGT 
ATAGACTGGG 
CTCAGGTTTA 
TGTTACTGCC 
ATAGAATCAT 
TGAACTTGCC 
CTTTACACAG 
CCTGTTCGTC 
CACACAGAGA 
CATAAATTGT 
TCATTTTGCT 
GCTCAAGAGA 
CAGTCCTATA 
TAGAGCTCTG 
GTTGATATAG 
CCTGAACTGA 
TGGGCTTCAT 
CCTCCATCTG 
GCTGAATGCC 
ACATTCTCTT 
AAGGTGATCT 


GCTTCGCAAA 
AAGTCGCGAA 
GGGGTAGTGC 
TTTGGCAGAT 
ATGATTGGAC 
TTCAGCAGCA 
GTTCCAGGCG 
TTATTGGCAA 
CCCAAAGGAA 
GCAGAAGAAG 
ACTTGCTTGA 
AGAAATGATG 
CCGTATCATA 
GGGACATTGA 
AGTCAAACAG 
TGAGGTTGTG 
TCCAAGCAAA 
CGCGGAGAAC 
TCATTTTGGG 
TAGGGCACTT 
AGTGGAAAGG 
GCAGCTGTCT 
GTATTCAGAT 
CAGTGTGGTC 
TAAGAGGTGC 
TGCAGAGAGT 


ATACCTTTTG 
CATCTTCTCC 
ATTTTATGGC 
GTuATAATTA 
ACTCCTTACA 
GCCACTCTGT 
CATCCAGTCA 
ATTGGAAGCA 
GCAACTTCAC 
ACCCAGCCTA 
AAAGGTATCT 
CCAGTTTTAA 
AACATTGAGA 
CCAGATTGTT 
TTAATTATCT 
CTGGTCCAGG 
GGGCCTTCTG 
CTGACATTAT 
GTGGTCCCAA 
TGTTTGGTGA 
AAGTGCTGGG 
GACACGGTCC 
CTTCCAAAGT 
AGCTGATGTG 
AATTTCTCCC 
CAATGAGGAG 


GTTCGAAATT 
GGATGTAGTC 
CCTTTGCAAC 
AAATGCTTTA 
GATGTTACAC 
GCAAGACGGG 
AGGAAGAGAG 
GCTCCTGACA 
CCAACTGTCT 
CGGTGGTCAC 
GTACTTGTCT 
AAGACAGGAC 
AGCCAGTTGA 
GTGCTCTTGC 
GCAGGATATC 
GCATCACATG 
GAGCCTTCTG 
TCAGGTCAGC 
CAAGTGGTTT 
GATGGCTCTC 
ATGCAGGACC 
TCCACAGCCA 
GCTGAGGTTA 
CAAGGTCATT 
CGAAGTGCCT 
ATCCCCCACT 
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GGCTGCCAGG ATCCCTTGAT CACCTCAGCT TGGCGCAACT TGAGGTCCAG 
TTCATCGGCA GCTTCCTGAA GTTCCTGGAG TCTTTCAAGA GCTTCATCTA 5700 
TTTTTCTCTG CCAATCAGCT GAGCGCAGGT TCAATTTGTC CCATTCAGCG 5750 
TTGACCTCTT CAGCCTGCTT TCGTAGGAGC OjAGTGACAT TCTGAGCTCT 
TTCTTCAGGA GGCAGTTCTC TGGGCTCCTG GTAGAGTTTC TCTAGTCCTT 
CCAAAGGCTG CTCTGTCAGA AATATTCTCA CAGTCTCCAG AGTACTCATG 5900 
ATTACAGGTT CTTTAGTTTT CAATTCCCTC TTGAAGGCCC TATGTATATC 5950 
ATTCTGCTTC TGAACTGCTG GGAAATCACC ACCGATGGGT GCCTGACGGC 6000 
TCAGTTCATC ATCTTTCAGC TGTAGCCAAA CAAGAAGTTC CTGAAGAGAA 6050 
AGATGCAAAC GCTTCCACTG GTCAGAACTT GCTTCCAAAT GGGACCTAAT 
GTTGAGAGAC TTTTTCTGAA GTTCACTCCA CTTGAAATTC ATGTTATCCA 
AACGTCTTTG TAACAGGGGT GCTTCATCCG AACCTTCCAG GGATCTCAGG 
ATTTTTTGGC CATTTTCATC AAGATTGTGA TAGATATCTG TGTGAGTTTC 
AATTTCTCCT TGGAGATCTT GCCATGGTTT CATCAGCTCT CTGACTCCCC 6300 
TGGAGTCTTC TAGGAGCTTC TCCTTACGGG AAGCGTCCTG TAGGACATTG 
GCAGTTGTTT CTGCTTCCGT AATCCAGGAA AGAAACTTCT CCAGGTCCAG 
AGGGAACTGC TGCAGTAATC TATGAGTTTC TTCCAAAGCA GCCTCTTGCT 
CACTTACTCT TTTATGAATC TTTCCCCAAG AAGTATTGAT ATTCTCTGTT 
ATCATGTGTA CTTTTCTGGT ATCATCAGCA GAATAGTCCC GAAGAAGTTT 
CAGTGCCAAA TCATTTGCCA CGTCTACACT TATCTGCCGT TGACGGAGGT 
CTTTGGCCAA CTGCTTGGTT TCTGTGATCT TCTTTTGGAT TGCATCTACT 
GTGTGAGGAC CTTCTTTCCA TGAGTCAAGC TTGCCTCTGA CCTGTCCTAT 
GACCTGTTCG GCTTCTTCCT TAGCTTCCAG CCATTGTGTT GAATCCTTTA 
ACATTTCATT CAACTGTTGT CTCCTGTTCT GCAGCTGTTC TTGAACCTCA 
TCCCACTGAA TCTGAATTCT TTCAATTCGA TCAGTAATGA TTGTTCTAGC 
TTCTTGATTG CTGGTTTTCT TTTTCAAATT CTGGGCAGCA GTAATGAGTT 
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5800 
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6100 
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CTTCCAATTG GGGGCGTCTC TGTTCCAAAT CTTGCAGTGT TGCCTTCTGT 6950 

TTGATGATCA TTTCATTGAT GTCTTCCAGA TCACCCACCA TCACTCTCTG 7000 

TGATTTTATA ACTCGATCAA GCAGAGACAG CCA6TCT6TA AGTTCTGTCC 7050 

AAGCTCGGTT GAAGTCTGCC AGTGCAGGTA CC.CCAACAG CAAAGAAGAT 7100 

GGCATTTCTA GTTTGGAGAT GACAGTTTCC TTAGTAACCA CAGATTGTGT 7150 

CACTAGAGTA ACAGTCTGAC TGGCAGAGGC TCCAGTAGTG CTCAGTCCAG 7200 

GGGCACGGTC AGGCTGCTTT GTCCTCAGCT CCCGAAGTAA ATGGTTTACA 7250 

GCCTCCCACT CAGACCTCAG ATCTTCTAAC TTCCTCTTCA CTGGCTGAGT 7300 

GCTTGGTTTT TCCTTATACA AATGCTGCCC TTTCGACAAA AGCCTTTCCA 7350 

CATCCGCTTG TTTACCGTGA ACTGTTACTT CAATCTCCTT TATGTCAAAC 7400 

GGTCCTGCCT GACTTGGTTG GTTATAAATT TCCAACTGGT TTCTAATAGG 7450 

AGAGACCCAC AGAAGCAGGT GATCCAGCTG CTCTTCAAGC TGCCTAAAAT 7500 

CTTTTAAGTG AACCTCAAGC TCTCCTTGTT TCTCAGGTAA AGCTCTGGAG 7550 

ACCTTTATCC ACTGGAGATT TGTCTGTTTG AGCTTCTTTT CAAGTTTATC 7600 

TTGCTCTTCT GGCCTTATGG GAGCACTTAC AAGTACTGCT CCTCCTGTTT 7650 

CATTTAATTG TTTTAGAATT CCCTGGCGCA GGGGCAACTC TTCTGCCAGT 7700 

AACTTGACTT GTTCAAGTTG TTCTTTTAGC TGCTGCTCAT CTCCAAGTGG 7750 

AGTAATAGCA ATGTTATCTG CTTCTTCCAG CCACAAAACA AATTCATTTA 7800 

AATCTCTTTG AAATTCTGAC AAGACATTCT TTTGTTCTTC AATCCTCTTT 7850 

CTCCTTTCTG CCAGCTCTTT GCAGATGTCG TGCCACCGCA GACTCAAGCT 7900 

TCCTAATTTT TCTTGTAGAA TATTGACATC TGTTTTTGAA GACTGTTGAA 7950 

TTATTTCTTC CCCAGTTGCA TTCAGTGTTC TGACAACAGC TTGACGCTGC 8000 

CCAATGCCAT CCTGGAGTTC CTTAAGATAC CATTTGTATT TAGCATGTTC 8050 

CCAGTTTTCA GGATTTTGTG TCTTTTTGAA AAACTGTTCA ACTTCATTCA 8100 

GCCATTGATT AAATACCTTC ATATCATAAT GAAAGTGTCG CCATTTTTCA 8150 

ACTGATCTGT CGAATCGCCC TTGTCGTTCC TTGTACATTC TATGAAGTTT 8200 
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TTCCCCCTGG AAATCCATCT GTGCCACGGC TTCCTGTACT TTCACCTTTT 8250 
CCATGGAGGT GGCACTTTGC AAGGCTGCTG TCTTCTTCTT GTGAATAATA 8300 
TCAATCCGAC CTGAGATTTG TTGCAAATTG TCTTTTATAT TCTTAAGAGA 8350 
CTCCTCTTGC TTAAAAAGAT CTTCAAAATC T.TAGCACAG AGTTCAGGAG 8400 
TATTTAGAAG ATGATCAACT TCTGAAAGAG CTTGTAAGAT ATGACTGATC 8450 
TCGGTCAAAT AAGTAGAAGG CACATAAGAA ACATCCAAAG GCATATCTTC 8500 
AGTCGTCACT ACCATAGTTT CTTCATGGAG AGTGTGAATT TGTGCAAAGT 8550 
TGAGTCTTCG AAACTGAGCA AAATTGCTCT CAATTTGCCG CCAGCGCTTG 8600 
CTGAGCTGGA TCTGAGTTGG CTCCACTGCC ATTGCGGCCC CATTCTCAGA 8650 
CAAGCCCTCA GCTTGCCTGC GCACTGCATT CAGCTCCTCT TTCTTCTTCT 8700 
GCAATTCACG ATCAATTTCC TTTAATTTTC TTTCATCTCT GGGTTCAGGT 8750 
AGGCTGGCTA ATTTTTTTTC AATTTCATCC AAGCATTTCA GGAGATCATC 8800 
AGCCTGCCTC TTGTACTGAT ACCACTGGTG AGAAATTTCT AGGGCCTTTT 8850 
TTOTCTTTG AGACCTCAAA TCCTTGAGAG CATTATGTTT TGTCTGTAAC 8900 
AGCTGCTGTT TTATCTTTAT TTCCTCTCGC TTTCTCTCAT CTGTGATTCT 
TTGTTGTAAG TTGTCTCCTC TTTGCAACAA TTCATTTACA GTACCCTCAT 
TGTCTTCACT CATATCTTTA TTGAAGTCTT CCTCTTTCAG ATTCACCCCC 9050 
TGCTGAATTT CAGCCTCCAG TGGTTCAAGC AATTTTTGTA TATCTGAGTT 
AAACTGCTCC AATTCCTTCA AAGGAATGGA GGCCTTTCCA GTCTTAATTC 
TGTGAGAAAT AGCTGCAAAT CGACGGTTGA GCTCAGAGAT TTGGGGCTCT 
ACTACTTTCC TGCAGTGGTC ACCGCGGTTT GCCATCAATT TTGCTGCTTG 
GTCACGTGTG GAGTCCACCT TTGGGCGCAT GTCATTCATT TCAGCCTTTA 9300 
AACGCTTAAG AATGTCTTCC TTTTGTTGTC GTTTCTTCTT TTCAGACTCA 9350 
TCTAAAAGTT CATCTGCATG AATGATCCAC TTTGTGATTT GTTCTATGTT 
CTGATCAAAG GTTTCCATGT GTTTCTGGTA TTCCAACAAA AGATTTAGCC 
ATTCTTCTAC TCTGGAGGTG ACAGCTATCC AGTTACTGTT CAGAAGACTC 
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9000 
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9800 
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9900 
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AGTTTATCTT CTACCAAGGT TTCTTTCTTG CCCAACACCA TTTTCAAAGA 9550 
CTCTCCTAAT TCTGTAACAC TCTTCAAGTG AGCCTTCTGT TTCTCAATCT 9600 
CTTTTTGAGT AGCCTTTCCC CAGGCAACTT CAGAATCCAA ATTACTTGGC 9650 
ATTCCTTCAA CTGCTGATCT CTTCGTCAAT TCxGTATCTG TTGCTGCCAG 9700 
CCATTCTGTT AAGACATTCA TTTCCTTTCT CATCTTACGG GACAACTTCA 9750 
AGCATTTCTC CAACTGTTGC TTTCTCTCTG TTACCTTCGC ACCCAACTCA 
TTGTAATGCA ATTTCAAAGC TGTTACTCGT TCATCAAGCT CTTTGGGATT 
TTCTGTCTGC TTTTTCTGTA CAATTTGACG TCCGGTTTTA ATCACCATTT 
CCACTTCAGA CTTGACTTCA CTCAGGCTTT TATACAAGTT CACACAATGA 
CTTAGTTGTG ACTGAATTAC TTCCTGTTCA ACACTCTTGG TTTCCAATGC 10000 
AGGCAAATGC ATCTTGACTT CATCTAAAAT CATCTTACTT TCCTCTAGAC 10050 
GTTGTTCAAA ATTGGCTGGT TTTTGGAATA ATCGAAATTT CATGGAGACA 10100 
TCTTGTAATT TTTTCTGTGC AACATCAATT TGTGAAAGAA CCCTTTGGTT 10150 
GGCATCCTTC CCCTGGTTAT GTTTCTTCAT TTCTTCTAAA CTTATCTCAT 10200 
GACTTGTCAA ATCTGATTGG ATTTTCTGGG CTTCCTGAGG CATTTGAGCT 10250 
GCATCCACCT TGTCAGTGAT ATAAGCTGCC AACTGCTTGT CAATGAATTC 10300 
AAGCGACTCC TGAATTAAGT GCAAGGACTT TTCAATTTCC TGGGCAGACT 10350 
GGATACTCTG TTCAAGCAAC TTTTGTTTCC TCACAGCCTC TTCATGTAGT 10400 
TCCCTCCAAC GAGAATTAAA CGTCTCAAGC TCCTCATTGA TCAGTTCATC 10450 
CATGACTCCT CCATCTGTAA GAGTCTGTGC CAATAGACGA ATCTGATTTG 10500 
GGTTCTCCTC TGAATGATGC ATCAGATTTT CAAGAGATTC TAGCACTTCA 10550 
GTGATTTCCT CAGGTCCTGC AGGAACATTT TCCATCGTTT TAAGTTTCAA 10600 
TTCTACTTCA TTGAGCCACT TCTTTGCTTT CTCTAAATAT GACAATAACT 10650 
CATGCCAACA TGCCCAAACT TCTTCCAAAG TTTTGCATTT TCCATTCAGC 10700 
CTGGTGCACA GCCATTGGTA GTTGGTGGTC AGAGTTTCAA GTTCCTTTTT 10750 
TAAGGCCTCT TGTGCTGAGG GTGGAGCGTG AGCTATTACA CTATTTACAG 10800 
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TCTCAGTAAG GAGTTTCACT TTAGTTTCTT TTTGTAGTGC CTCTTCTTTA 10850 

GCTCTCTTCA TTTCTTCAAC AGCAGTCTGT AATTCATCTG GAGTTTTATA 10900 

TTCAAAATCT CTCTCTAGAT ATTCTTCTTC AGCTTGTGTC ATCCACTCAT 10950 

GCATCTCTGA TAGATCTTTT TGGAGGCTTA CWTTTTATC CAAACCTGCC 11000 

TTTAAGGCTT CCTTTCTGGT GTAGACCTGG CGGCATATGT GATCCCACTG 11050 

AGTGTTAAGC TCTCTAAGTT CTGTCTCCAG TCTGGATGCA AACTCAAGTT 11100 

CAGCTTCACT CTTTATCTTC TGCCCACCTT CATTAACACT ATTTAAACTG 11150 

GGCTGAATTG TTTGAATATC ACCAACTAAA AGTCTGCATT GTTTGAGCTG 11200 

TTTTTTCAGG ATTTCAGCAT CCCCCAGGGC AGGCCATTCC TCTTTCAGGA 11250 

AAACATCAAC TTCAGCCATC CATTTCTGTA AGGTTTTTAT GTGATTCTGA 11300 

AATTTTCGAA GTTTATTCAT ATGTTCTTCT AGCTTTTGGC AGCTTTCCAC 11350 

CAACTGGGAG GAAAGTTTCT TCCAGTGCCC CTCAATCTCT TCAAATTCTG 11400 

ACAGATATTT CTGGCATATT TCTGAAGGTG CTTTCTTGGC CATCTCCTTC 11450 

ACAGTGTCAC TCAGATAGTT GAAGCCATTT TGTTGCTCTT TCAAAGAACT 11500 

TTGCAGAGCC TGTAATTTCC CGAGTCTCTC CTCCATTATT TCATATTCAG 11550 

TAACACTAAG ATAAGGTACA GAGAGTTTGC TTTCTGACTG CTGGATCCAC 11600 

GTCCTGATGC TACTCATTGT CTCCTGATAG CGCATTGGTG GTAAAGTGTC 11650 

AAAAATTGTC TGTAGCTCTT TCTCTTTGGC CCTCACACCA TCAAAGATCT 11700 

GGTTAAAATG ATTAGTAAAG GCCACAAAGT CTGCATCCAG AAACATTGGC 11750 

CCCTGTCCCT TTTCTTTCAG TTGTAGACTC TGAATTTTTA ATTGCTCAAT 11800 

TTGAGGCTGA AGAGCTGACA ATCTGTTGAC TTCATCCTTA CAAATTTTTA 11850 

ACTGGCTTTT AATTGCTGTT GGCTCTGATA GGGTGGTAGA CTGGGTTTTC 11900 

AACAAGTTTT CGGCAGTAGT TGTCATCTGT TCCAATTGTT 6TAGCTGATT 11950 

ATAAAAGGTA ATGATGTTGG TTTGATACTC TAGCCAGTTA ACTCTCTCAC 12000 
TCAGCAATTG GCAGAATTCT GTCCACCGGC TGTTCAGTTG TTCTGAAGCT 12050 
TGTCTGATAC TTTCAGCATT AACACCCTCA TTTGCCATCT GTTCCACCAG 12100 
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GGCCT6A6CT 

GATCTGCTGC 

CATCTTGCAG 

TTTTCTGAAC 

TTCTCTGCTT 

12150 

TTTCTCGTGC 

TATGGCATTG 

ACTTTTTCTT 

GCAAGTCTGA 

GATGTTGCCT 

12200 

TCTTTTCGAT 

AGACTGCAAA 

TTCAGAACTC 

TGTAATACAG 

CTTCTGAACG 

12250 

A6TAATCCAA 

CTGTGAAGTT 

CAGTTATATC 

GAOATCCAAC 

CTTTTCCTGA 

12300 

GTTCAGAATC 

CACAGTTATC 

TGCCTCTTCT 

TTTGAGGAGG 

TGGTGGTGGA 

12350 

AGTTCCTCTT 

GGGCATGTTT 

TACCATGATT 

TGTTCCCTTG 

TGGTCACCAT 

12400 

AGTTACCGTT 

TCCATTACAG 

TTGTCTGTGT 

TAGGGATGGT 

TGAGTGGTGG 

12450 

TGACAGCCTG 

TGAAATTTGT 

GCTGAACTCT 

TTTCAAGTTT 

TTGGGTTAAA 

12500 

TTGTCCCAAC 

GTTGTGCAAA 

GTTTTCCATC 

CAGATTTCCA 

TCTTTTGAGT 

12550 

CACTGACTTA 

TTTTTCAGTG 

CCGAAAGTAG 

ATCTTGATTG 

AGTGAACTTA 

12600 

GTTTTTCCAT 

GGTTGGCTTT 

TTCTTTTCTA 

GATCTATTTT 

TAAAGTAGAT 

12650 

ATTTTGTGAA 

GACTTGACAT 

CATTTCATTT 

TGATCTTTAA 

AGCCACTTGT 

12700 

CTGAATGTTC 

TTCATTGCAT 


TGAAAGCCAT 

GTACTAAAAA 

12750 

GGCACTGTTC 

TTCAGTAAAA 

TGCTGCCATT 

TTAGAAGAAT 

ATCTTGTAAA 

12800 

ACAATCCAGC 

GGTCTTCAGT 

CCATCTGCAG 

ATATTTGCCC 

ATCGATCTCC 

12850 

CAGTACCTTA 

AGTTGTTCTT 

CCAAAGCAGC 

TGTTGCATGA 

TCACCGCTGG 

12900 

ATTCATCAAC 

CACTACTACC 

ATGTGAGTGA 

GCGAGTTGAC 

CCTGACCTGC 

12950 

TCCTGTTCTA 

GATCTTCTTG 

AAGCACCTTA 

TGTTGTTGTA 

CTTGGCATTT 

13000 

TAGATCTTCA 

AGATCAGGTC 

CAAAGGGCTC 

TTCCTCCATT 

TTCTTAGTTC 

13050 

TCTCTTCAGT 

TTTTGTTAAC 

CAGTCATCTA 

GTTCTTTTAA 

TTTCTGATTC 

13100 

TGGAGATCCA 

TTAGAACTTT 

GTGTAATTTG 

CTTTGTTTTT 

CCATGCTAGC 

13150 

TACCCTGAGA 

CATTCCCATC 

TTGAATTTAG 

GAGATTCATT 

TGTTCTTGCA 

13200 

CTTCAGCTTC 

TTCATCTTCT 

GATAATTTCC 

CTTTTCCAAC 

TAGTTGACTT 

13250 

CCTAACTGTA 

GAACATTACC 

AACAAGTCCT 

TGATGAGATG 

TCAGATCCAT 

13300 

CATGAATCCC 

TCATGAGCAT 

GAAACTGTTC 

TTTCACTTCT 

TCAACATCAT 

13350 

TTGAAATCTC 

TCCTTGTGCT 

CGCAATGTAT 

CCTCGGCAGA AAGAAGCCAT 

13400 
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GAAAGTACTT CTTCTAAAGC AGTTTGGTAA CTATCCAGAT TTACTTCCGT 13450 

CTCCATCAAT GAACTGTCAA GTGACTTGTC TCTGGGAGCT TCCAAATGCT 13500 

GTGAAGGATA GGGGCTCTGT GTGGAATCAG AGGTGGCAAC ATAAGCAGCC 13550 

TGTGTGAAGG CATAACTCTT GAATCGAGGC TxAGGAGATG AAGAAGTTTG 13600 

TTCATAGCCC TGTGCTAGAC TGACTGTGAT CTGTTGAGAG TAATGCATCT 13650 

GGTGATGTAA TTGAAAATGT TCTTCTCTAG TTACTTTTGA AGATGTCCTG 13700 

GGCAACATTT CCACTTCTTG AATGGCTTCA ATGCTCACTT GTTGTGGCAA 13750 

AACTTGAAAG AGTGATGTGA TGTACATTAA GATGGACTTC TTGTCTGGAT 13800 

AAGTGGTAGC AACATCTTCA GGATCAAGAA GTTTTTCTAT GCCTAACTGG 13850 

CATTTTGCAA TGTTGAAGGC ATGTTCCAGT CTTTGGGTGG CTGAGTGCTG 13900 

TGAAACCACA CTATTCCAAT CAAACAGGTC GGGCCTGTGA CTATGGATAA 13950 

GAGCATTCAA AGCCAACCCG TCGGACCAGC TAGAGGTGAA GTTGATGACG 14000 

TTAACCTGTG GATAATTACG TGTTGACTGT CGAACCCAGC TCAGAAGAAT 14050 

CTTTTCACTG TTGGTTTGCT GCAATCCAGC CATGATAGTT TTCATCACAT 14100 

TTTTGACCTG CCAGTGGAGG ATTATATTCC AAATCAAACC AAGAGTGAGT 14150 

TTATGATTTC CATCCACTAT GTCAGTGCTT CCTATATTCA CTAAATCAAC 14200 

ATTATTTTTC TGTAAGACCC GCAGTGCCTT GTTGACATTG TTCAGGGCAT 14250 

GAACTCTTGT AGATCCCTTT TCTTTTGGCA GTTTTTGCCC TGTAAGGCCT 14300 

TCCAAGAGGT CTAGGAGGCG TTTTCCATCC TGCAGGTCAC TGAAGAGGTT 14350 

GTCTATGTGT TGCITTCCAA ACTTAGAAAA TTGTGCATTT ATCCATTTTG 14400 

TGAATGTTTT CTTTTGAACA TCTTCTCTTT CATAACAGTC CTCTACTTCT 14450 

TCCCACCAAA GCATTTGGAA GAAAAAGTAT ATATCAAGGC AGGGATAAAA 14500 

ATCTTGGTAA AAGTTTCTCC CAGTTTTATT GCTCCAGGAG GCTTAGGTAC 14550 

GATGAGAAGC CAATAAACTT CAGCAGCCTT GACAAAAAAA AAAAAAAAAA 14600 

TAGCACTTCA AGTCTTCCTA TTCGTTTTTT CTATAAAGCT ATTGCCTTCA 14650 

AGAGCGGAAT TCCTGCAGCC CGGGGGATCC ACTAGTTCTA GAGCGGCCGC 14700 
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GGGTACAATT CCGCAGCTTT TAGAGCAGAA 
TAGAAGTAAA GGCAACATCC ACTGAGGAGC 
CACCGGATCC GGGACCTGAA ATAAAAGACA 
TAACTTTCTG GTTTTTCAGT TCCTCGAGTA 
AGGCTGGATC GGTCCCGGTG TCTTCTATGG 
GCGTCTCCAG GCGATCTGAC GGTTCACTAA 
CCTCCCACCG TACACGCCTA CCGCCCATTT 
TTACGACATT TTGGAAAGTC CCGTTGATTT 
ATTGACGTCA ATGGGGTGGA GACTTGGAAA 
ATCCACGCCC ATTGATGTAC TGCCAAAACC 
ATGACTAATA CGTAGATGTA CTGCCAAGTA 
GTACTGGGCA TAATGCCAGG CGGGCCATTT 
GGGGCGTACT TGGCATATGA TACACTTGAT 
TACCGTAAAT ACTCCACCCA TTGACGTCAA 
TACTATGGGA ACATACGTCA TTATTGACGT 
GCGGTCAGCC AGGCGGGCCA TTTACCGTAA 
TCGACTCTAG AGGATCTCCC TAGACAAATA 
AAAATTATTC AGATTTCACT TCCTCTTATT 
CCAAATCTTA CTCGGTTACG CCCAAATTTA 
CGCGCGAAAA TTGTCACTTC CTGTGTACAC 
ACTTTTGCCA CATCCGTCGC TTACATGTGT 
ACACTTCCGC CACACTACTA CGTCACCCGC 
CACGTCACAA ACTCCACCCC CTCATTATCA 
AAGGTATATT ATTGATGATG CTAGCGGGGC 
CAATGATCAT CATGACAGAT CTGCGCGCGA 
TTGCGCATGC TAGCTATAGT TCTAGAGGTA 


GTAACACTTC 

CGTACAGGCC 

14750 

AGTTCTTTGA 

TTTGCACCAC 

14800 

AAAAGACTAA 

ACTTACCAGT 

14850 

CG-GATCCTC 

TAGAGTCCGG 

14900 

AGGTCAAAAC 

AGCGTGGATG 

14950 

ACGAGCTCTG 

CTTATATAGA 

15000 

GCGTCAATGG 

GGCGGAGTTG 

15050 

TGGTGCCAAA 

ACAAACTCCC 

15100 

TCCCCGTGAG 

TCAAACCGCT 

15150 

GCATCACCAT 

GGTAATAGCG 

15200 

GGAAAGTCCC 

ATAAGGTCAT 

15250 

ACCGTCATTG 

ACGTCAATAG 

15300 

GTACTGCCAA 

GTGGGCAGTT 

15350 

TGGAAAGTCC 

CTATTGGCGT 

15400 

CAATGGGCGG 

GGGTCGTTGG 

15450 

GTTATGTAAC 

GACCTGCAGG 

15500 

TTACGCGCTA 

TGAGTAACAC 

15550 

CAGTTTTCCC 

GCGAAAATGG 

15600 

CTACAACATC 

CGCCTAAAAC 

15650 

CGGCGCACAC 

CAAAAACGTC 

15700 

TCCGCCACAC TTGCAACATC 

15750 

CCCGTTCCCA 

CGCCCCGCGC 

15800 

TATTGGCTTC 

AATCCAAAAT 

15850 

CCTATATATG 

GATCCAATTG 

15900 

TCGATATCAG 

CGCTTTAAAT 

15950 

CCGGTTGTTA 

ACGTTAGCCG 

16000 
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GCTACGTATA CTCCGGAATA TTAATAGGCC TAGGATGCAT ATG6CGGCCG 16050 
GCCGCCTGCA GCTGGCGCCA TCGATACGCG TACGTCGCGA CCGCGGACAT 16100 
GTACAGAGCT CGAGAAGTAC TAGTGGCCAC GTGGGCCGTG CACCTTAAGC 16150 
TTGGCACTGG CCGTCGTTTT ACAACGTCGT CUWTGGGAAA ACCCTGGCGT 16200 
TACCCAACTT AATCGCCTTG CAGCACATCC CCCTTTCGCC AGCTGGCGTA 16250 
ATAGCGAAGA GGCCCGCACC GATCGCCCTT CCCAACAGTT GCGCAGCCTG 16300 
AATGGCGAAT GGCGCCTGAT GCGGTATTTT CTCCTTACGC ATCTGTGCGG 16350 
. TATTTCACAC CGCATACGTC AAAGCAACCA TAGTACGCGC CCTGTAGCGG 16400 
CGCATTAAGC GCGGCGGGTG TGGTGGTTAC GCGCAGCCTG ACCGCTACAC 16450 
TTGCCAGCGC CCTAGCGCCC GCTCCTTTCG CTTTCTTCCC TTCCTTTCTC 16500 
GCCACGTTCG CCGGCTTTCC CCGTCAAGCT CTAAATCGGG GGCTCCCTTT 16550 
AGGGTTCCGA TTTAGTGCTT TACGGCACCT CGACCCCAAA AAACTTGATT 16600 
TGGGTGATGG TTCACGTAGT GGGCCATCGC CCTGATAGAC GGTTTTTCGC 16650 
CCTTTGACGT TGGAGTCCAC GTTCTTTAAT AGTGGACTCT TGTTCCAAAC 16700 
TGGAACAACA CTCAACCCTA TCTCGGGCTA TTCTTTTGAT TTATAAGGGA 16750 
TTTTGCCGAT TTCGGCCTAT TGGTTAAAAA ATGAGCTGAT TTAACAAAAA 16800 
TTTAACGCGA ATTTTAACAA AATATTAACG TTTACAATTT TATGGTGCAC 16850 
TCTCAGTACA ATCTGCTCTG ATCCCCCATA GTTAAGCCAG CCCCGACACC 16900 
CGCCAACACC CGCTGACGCG CCCTGACGGG CTTGTCTGCT CCCGGCATCC 16950 
GCTTACAGAC AAGCTGTGAC CGTCTCCGGG AGCTGCATGT GTCAGAGGTT 17000 
TTCACCGTCA TCACCGAAAC GCGCGAGACG AAAGGGCCTC GTGATACGCC 17050 
TATTTTTATA GGTTAATGTC ATGATAATAA TGGTTTCTTA GACGTCAGGT 17100 
GGCACTTTTC GGGGAAATGT GCGCGGAACC CCTATTTGTT TATTTTTCTA 17150 
AATACATTCA AATATGTATC CGCTCATGAG ACAATAACCC TGATAAATGC 17200 
TTCAATAATA TTGAAAAAGG AAGAGTATGA GTATTCAACA TTTCCGTGTC 17250 
GCCCTTATTC CCTTTTTTGC GGCATTTTGC CTTCCTGTTT TTGCTCACCC 17300 
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AGAAACGCTG 

GTGAAAGTAA 

AAGATGCTGA 

AGATGAGTTG 

GGTGCACGAG 

17350 

TGGGTTACAT 

CGAACTGGAT 

CTCAACAGCG 

GTAAGATCCT 

TGAGAGTTTT 

17400 

C6CCCCGAAG 

AACGTTTTCC 

AATGATGAGC 

ACTTTTAAAG 

TTCTGCTATG 

17450 

TGGCGCGGTA 

TTATCCCGTA 

TTGACGCCGG 

GCAAGAGCAA 

CTCGGTCGCC 

17500 

GCATACACTA 

TTCTCAGAAT 

GACTTGGTTG 

AGTACTCACC 

AGTCACAGAA 

17550 

AAGCATCTTA 

CGGATGGCAT 

GACAGTAAGA 

GAATTATGCA 

GTGCTGCCAT 

17600 

AACCATGAGT 

GATAACACTG 

CGGCCAACTT 

ACTTCTGACA 

ACGATCGGAG 

17650 

GACC6AA6GA 

GCTAACCGCT 

TTTTTGCACA 

ACATGGGGGA 

TCATGTAACT 

17700 

CGCCTTGATC 

GTTGGGAACC 

GGAGCTGAAT 

GAAGCCATAC 

CAAACGACGA 

17750 

GCGTGACACC 

ACGATGCCTG 

TAGCAATGGC 

AACAACGTTG 

CGCAAACTAT 

17800 

TAACTGGCGA 

ACTACTTACT 

CTAGCTTCCC 

GGCAACAATT 

AATAGACTGG 

17850 

ATGGA6GCGG 

ATAAAGTTGC 

AGGACGACTT 

CTGCGCTCGG 

CCCTTCCGGC 

17900 

TGGCTGGTTT 

ATTGCTGATA 

AATCTGGAGC 

CGGTGAGCGT 

GGGTCTCGCG 

17950 

GTATCATTGC 

AGCACTGGGG 

CCAGATGGTA 

AGCCCTCCCG 

TATCGTAGTT 

18000 

ATCTACACGA 

CGGGGAGTCA 

GGCAACTATG 

GATGAACGAA 

ATAGACAGAT 

18050 

CGCTGAGATA 

GGTGCCTCAC 

TGATTAAGCA 

TTGGTAACTG 

TCAGACCAAG 

18100 

TTTACTCATA 

TATACTTTAG 

ATTGATTTAA 

AACTTCATTT 

TTAATTTAAA 

18150 

AGGATCTAGG 

TGAAGATCCT 

TTTTGATAAT 

CTCATGACCA 

AAATCCCTTA 

18200 

ACGTGAGTTT 

TCGTTCCACT 

GAGCGTCAGA 

CCCCGTAGAA 

AAGATCAAAG 

18250 

GATCTTCTTG 

AGATCCTTTT 

TTTCTGCGCG 

TAATCTGCTG 

CTTGCAAACA 

18300 

AAAAAACCAC 

CGCTACCAGC 

GGTGGTTTGT 

TTGCCGGATC 

AAGAGCTACC 

18350 

AACTCTTTTT 

CCGAAGGTAA 

CTGGCTTCAG 

CAGAGCGCAG 

ATACCAAATA 

18400 

CTGTTCTTCT 

AGTGTAGCCG 

TAGTTAGGCC 

ACCACTTCAA 

GAACTCTGTA 

18450 

GCACCGCCTA 

CATACCTCGC 

TCTGCTAATC 

CTGTTACCAG 

TGGCTGCTGC 

18500 

CAGTGGCGAT 

AAGTCGTGTC 

TTACCGGGTT 

GGACTCAAGA 

CGATAGTTAC 

18550 

CGGATAAGGC 

GCAGCGGTCG 

GGCTGAACGG 

GGGGTTCGTG 

CACACAGCCC 

18600 
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AGCTTGGAGC GAACGACCT7. CACCGAACTG 
ATGAGAAAGC GCCACGCTTC CCGAAGGGAG 
TAAGCGGCAG GGTCGGAACA GGAGAGCGCA 
AACGCCTGGT ATCTTTATAG TCCTGTCGGG 
GCGTCGATTT TTGTGATGCT CGTCAGGGGG 
CCAGCAACGC GGCCTTTTTA CGGTTCCTGG 
CACATGTTCT TTCCTGCGTT ATCCCCTGAT 
CGCCTTTGAG TGAGCTGATA CCGCTCGCCG 
GCGAGTCAGT GAGCGAGGAA GCGGAAGAGC 
CTCCCCGCGC GTTGGCCGAT TCATTAATGC 
CGACTGGAAA GCGGGCAGTG AGCGCAACGC 
CTCATTAGGC ACCCCAGGCT TTACACTTTA 
TGTGGAATTG TGAGCGGATA ACAATTTCAC 
TGATTACGAA TTCGAATGGC CATGGGACGT 
CCGGGCC 


AGATACCTAC AGCGTGAGCT 18650 

AAAGGCGGAC AGGTATCCGG 18700 

CGAGGGAGCT TCCAGGGGGA 18750 

TTJCGCCACC TCTGACTTGA 18800 

GCGGAGCCTA TGGAAAAACG 18850 

CCTTTTGCTG GCCTTTTGCT 18900 

TCTGTGGATA ACCGTATTAC 18950 

CAGCCGAACG ACCGAGCGCA 19000 

GCCCAATACG CAAACCGCCT 19050 

AGCTGGCACG ACAGGTTTCC 19100 

AATTAATGTG AGTTAGCTCA 19150 

TGCTTCCGGC TCGTATGTTG 19200 

ACAGGAAACA GCTATGACCA 19250 

CGACCTGAGG TAATTATAAC 19300 
19307 


WO 96/13597 


PCT/US9S/14017 


98 

WHAT IS CLAIMED IS: 

1. A recombinant shuttle vector comprising: 

(a) the DMA sequences of, or corresponding to, 
a portion of the genome of an auenovirus which comprises 
DMA sequences of, or corresponding to, the adenovirus 5' 
and 3' inverted terminal repeats and packaging/ enhancer 
domain necessary for replication and virion encapsidation 
in the absence of sequence encoding viral genes; 

(b) a selected gene operatively linked to 
regulatory sequences directing its expression, said gene 
operatively linked to the DNA of (a) and capable of 
expression in a target cell in vivo or in vitro. 

2. The vector according to claim 1 wherein said 
DNA sequences (a) comprise the native adenovirus 5« 
inverted terminal repeats and packaging sequences. 

3. The vector according to claim 1 wherein said 
DNA sequences (a) comprise the native adenovirus 3' 
inverted terminal repeat sequences. 

4. The vector according to claim l wherein said 
selected gene (b) is a reporter gene. 

5. The vector according to claim 4 wherein said 
reporter gene is selected from the group consisting of 
the genes encoding fl-galactosidase, alkaline phosphatase 
and green fluorescent protein. 

6. The vector according to claim 1 wherein said 
selected gene (b) is a therapeutic gene. 
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7. The vector according to claim 6 wherein said 
therapeutic gene is selected from the group consisting of 
a normal CFTR gene, a DMD Becker allele and a normal LDL 
gene. 

8. A crippled adenovirus helper virus comprising a 
modified adenovirus sequence in place of native 
adenovirus sequence map units 0-1, which modification 
reduces the packaging efficiency of said virus, said 
virus also containing selected adenovirus genes necessary 
to direct a productive viral infection. 

9. The helper virus according to claim 8 wherein 
said modified sequence comprises: 

i. a fragment of adenovirus map units 0-1; 

ii. a fragment of (i) containing a 3\ inverted 
terminal repeat and between one to four selected 
packaging sequences, 

iii. a modified fragment of (i) containing at 
least one PAC consensus sequence in place of at least one 
native PAC sequence; and 

iv. a modified fragment of (ii) , wherein said 
native PAC sequences are mutated to contain modified 
sequences. 

10. The virus according to claim 8 wherein said 
modified sequence comprises Ad5 base pairs 1-269. 

11. The virus according to claim 8 wherein said 
sequence (ii) comprises Ad5 base pairs 1-321. 

12. The virus according to claim 8 wherein said 
helper adenovirus is conjugated to a poly-cati n 
sequence. 
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13. A method for producing a recombinant adenovirus 
which comprises transfecting a selected host cell with 

(a) a recombinant shuttle vector comprising 

i. the ONA sequences of , or 
corresponding to, a portion of \Jie genome of an 
adenovirus which comprises adenovirus 5' and 3' cis- 
elements necessary for replication and virion 
encapsidation in the absence of sequence encoding viral 
genes; and 

ii. a selected gene operative ly linked to 
regulatory sequences directing its expression, said gene 
linked to the DNA of (a) and capable of expression in a 
target cell in vivo or In vitro; and 

(b) a helper adenovirus comprising sufficient 
adenovirus gene sequences necessary for a productive 
viral infection, wherein said transfected host cell 
permits the formation of a recombinant virus comprising 
the DNA of (i) and (ii) in an adenoviral capsid, and 

isolating and purifying the recombinant virus from 
said cell. 

14 . The method according to claim 13 , wherein said 
helper virus is a crippled helper virus comprising a 
modified adenovirus sequence in place of native 
adenovirus sequence map units 0-1, which modification 
reduces the packaging efficiency of said helper virus, 
said helper virus also containing selected adenovirus 
genes necessary to direct a productive viral infection. 

15 . The method according to claim 13 wherein said 
helper adenovirus is associated with a poly-cation 
sequence. 
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16. The method according to claim 13 wherein said 
vector is associated with said helper adenovirus 
conjugate in a single particle. 

17. The method according co claim 13 wherein said 
helper virus is an adenovirus sequence containing 
deletions of all or portions of the Ela and Elb genes. 

18. The method according to claim 13 wherein said 
helper virus is an adenovirus sequence containing 
deletions of all or a portion of the E3 gene. 

19. A recombinant adenovirus comprising 

i. the DNA of, or corresponding to, a 
portion of the genome of an adenovirus which comprises 
adenovirus 5» and 3 • cis-elements necessary for 
replication and virion encapsidation in the absence of 
sequence encoding viral genes; 

ii. a selected gene operatively linked to 
regulatory sequences directing its expression, said gene 
linked to the DMA of (a) and capable of expression in a 
target cell in vivo or in vitro; 

said DMA and gene encapsidated in an adenoviral 
capsid. 

20. The virus according to claim 19 wherein said 
viral capsid is a capsid of an adenovirus serotype 
selected from the group consisting of types 2, 4, 5, 7, 
12 and 40. 

21. The virus according to claim 19 wherein said 
selected gene is a CFTR gene, a DMD gene and an LDL gene. 
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22. The use of a recombinant adenovirus according 
to claim 19 for the manufacture of a pharmaceutical 
composition suitable for delivering and integrating a 
selected gene into the chromosome of a target cell. 
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FIGURE 3A 


GAACTCGAGC 

AGCTGAAGCT 

TGAATTCCAT CATCAATAAT 

ATACCTTATT 

50 

TTGGATTGAA 

GCCAATATGA 

TAATGAGGGG GTGGAGTTTG 

TGACGTGGCG 

100 

CGGGGCGTGG 

GAACGGGGCG 

GGTGACGTAG GTTTTAGGGC 

GGAGTAACTT 

150 

GTATGTGTTG GGAATTGTAG 

TTTTCTTAAA ATGGGAAGTT 

ACGTAACGTG 

200 

GGAAAACGGA 

AGTGACGATT 

TGAGGAAGTT GTGGGTTTTT 

TGGCTTTCGT 

250 

TTCTGGGCGT 

AGGTTCGCGT 

GCGGTTTTCT GGGTGTTTTT 

TGTGGACTTT 

300 

AACCGTTACG 

TCATTTTTTA 

GTCCTATATA TACTCGCTCT 

GCACTTGGCC 

350 

CTTTTTTACA 

CTGTGACTGA 

X xuAlaw xvw x \»v»v*w w x wva 

GTGGTGTTTT 

400 

TTTAATAGGT TTTCTTTTTT 

ACTGGTAAGG CTGACTGTTA 

GGCTGCCGCT 

450 

GTGAAGCGCT 

GTATGTTGTT 

CTGGAGCGGG AGGGTGCTAT 

TTTGCCTAGG 

500 

CAGGAGGGTT 

TTTCAGGTGT 

TTA7GTGTTT TTCTCTCCTA 

TTAATTTTGT 

550 

TATACCTCCT 

ATGGGGGCTG 

iAAiul iulL xv.XAlA>v>t>XAa 

CGGGTATGTA 

600 

TTCCCCCCAA 

GCTTGCATGC 


ATCCGAAAAA . 

650 

ACCTCCCACA 

CCTCCCCCTG 

AACCTGAAAC ATAAAATGAA 

TGCAATTGTT 

700 

GTTGTTAACT 

TGTTTATTGC 

AG CTT ATAAT GGTTACAAAT 

AAAGCAATAG 

750 

CATCACAAAT 

TTCACAAATA 

AAuuUIlll HUlWuuii 

TCTAGTTGTG 

800 

GTTTGTCCAA 

ACTCATCAAT 


CCCCGCGGCC 

850 

GCCTAGAGTC 

GAGGCCGAGT 

X lAllUwnnn uVilMftUWUIA 

CAGCGGTTGG 

900 


&&T&areeff& aaaataatac 

JiimXAwVAalfWJm JUUUUIIAIAW 

CCGTATCACT 

950 

TTTGCTGATA TGGTTGATGT 

^•ifPTTl/TPPl A AIVR/USA A A 
wAIulAliUJI Aa1WU\jwuu* 

AACGGGAAGT 

1000 

AGGCTCCCAT 

GATAAAAAAG 

TAAAAGAAAA AGAATAAACC 

GAACATCCAA 


AAGTTTGTGT 

TTTTTAAATA 

GTACATAATG GATTTCCTTA 

CGCGAAATAC 

1100 

GGGCAGACAT 

GGCCTGCCCG 

GTTATTATTA TTTTTGACAC 

CAGACCAACT 

1150 

GGTAATGGTA 

GCGACCGGCG 

CTCAGCTGTA ATTCCGCCGA 

TACTGACGGG 

1200 

CTCCAGGAGT 

CGTCGCCACC 

AATCCCCAXA TGGAAACCGT 

CGATATTCAG 

1250 

CCATGTGCCT 

TCTTCCGCGT 

GCAGCAGATG GCGATGGCTG 

CTTTCCATCA 

1300 

GTTGCTGTTG 

ACTGTAGCGG 

CTGATGTTGA ACTGGAAGTC 

GCCGCGCCAC 

1350 


WO 96/13597 


PCT/US9S/14017 


4/45 
FIGURE 3B 

TGGTGTGGGC CATAATTCAA TTCGCGCGTC CCGCAGCGCA GACCGTTTTC 1400 

GCTCGGGAAG ACGTACGGGG TATACATGTC TGACAATGGC AGATCCCAGC 1450 

GGTCAAAACA GGCGGCAGTA AGGCGGTCGG GATAGTTTTC TTGCGGCCCT 1500 

AATCCGAGCC AGTTTACCCG CTCTGCTACC TGCGCCAGCT GGCAGTTCAG 1550 

GCCAATCCGC GCCGGATGCG GTGTATCGCT CGCCACTTCA ACATCAACGG 1600 

TAATCGCCAT TTGACCACTA CCATCAATCC GGTAGGTTTT CCGGCTGATA 1650 

AATAAGGTTT TCCCCTGATG CTGCCACGCG TGAGCGGTCG TAATCAGCAC 1700 

CGCATCAGCA AGTGTATCTG CCGTGCACTG CAACAACGCT GCTTCGGCCT 1750 

GGTAATGGCC CGCCGCCTTC CAGCGTTCGA CCCAGGCGTT AGGGTCAATG 1800 

CGGGTCGCTT CACTTACGCC AATGTCGTTA TCCAGCGGTG CACGGGTGAA 1850 

CTGATCGCGC AGCGGCGTCA GCAGTTGTTT TTTATCGCCA ATCCACATCT 1900 

GTGAAAGAAA GCCTGACTGG CGGTTAAATT GCCAACGCTT ATTACCCAGC 1950 

TCGATGGAAA AATCCATTTC GCTGGTGGTC AGATGCGGGA TGGCGTGGGA 2000 

CGCGGCGGGG AGCGTCACAC TGAGGTTTTC CGCCAGACGC CACTGCTGCC 2050 

AGGCGCTGAT GTGCCCGGCT TCTGACCATG CGGTCGCGTT CGGTTGCACT 2100 

ACGCGTACTG TGAGCCAGAG TTGCCCGGCG CTCTCCGGCT GCGGTAGTTC 2150 

AGGCAGTTCA ATCAACTGTT TACCTTGTGG AGCGACATCC AGAGGCACTT 2200 

CACCGCTTGC CAGCGGCTTA CCATCCAGCG CCACCATCCA GTGCAGGAGC 2250 

TCGTTATCGC TATGACGGAA CAGGTATTCG CTGGTCACTT CGATGGTTTG 2300 

CCCGGATAAA CGGAACTGGA AAAACT6CTG CTGGTGTTTT GCTTCCGTCA 2350 

GCGCTGGATG CGGCGTGCGG TCGGCAAAGA CCAGACOGTT CATACAGAAC 2400 

TGGCGATCGT TCGGCGTATC GCCAAAATCA CCGCCGTAAG CCGACCACGG 2450 

GTTGCCGTTT TCATCATATT TAATCAGCGA CTGATCGACC CAGTCCCAGA 2500 

CGAAGCCGCC CTGTAAACGG GGATACTGAC GAAACGCCTG CCAGTATTTA 2550 

GCGAAACCGC CAAGACTGTT ACCCATCGOG TGGGCGTATT CGCAAAGGAT 2600 

CAGCGGGCGC GTCTCTCCAG GTAGCGAAAG CCATTTTTTG ATGGACCATT 2650 
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FIGURE 3C 

TCGGCACAGC CGGGAAGGGC TGGTCTTCAT CCACGCGCGC GTACATCGGG 2700 

CAAATAATAT CGGTGGCCGT GGTGTCGGCT CCGCCGCCTT CATACTGCAC 2750 

CGGGCGGGAA GGATCGACAG ATTTGATCCA GCGATACAGC GCGTCGTGAT 2800 

TAGCGCCGTG GCCTGATTCA TTCCCCAGCG ACCAGATGAT CACACTCGGG 2850 

TGATTACGAT CGCGCTGCAC CATTCGCGTT ACGCGTTCGC TCATCGCCGG 2900 

TAGCCAGCGC GGATCATCGG TCAGACGATT CATTGGCACC ATGCCGTGGG 2950 

TTTCAATATT GGCTTCATCC ACCACATACA GGCCGTAGCG GTCGCACAGC 3000 

GTGTACCACA GCGGATGGTT CGGATAATGC GAACAGCGCA CGGCGTTAAA 3050 

GTTGTTCTGC TTCATCAGCA GGATATCCTG CACCATCGTC TGCTCATCCA 3100 

TGACCTGACC ATGCAGAGGA TGATGCTCGT GACGGTTAAC GCCTCGAATC 3150 

AGCAACGGCT TGCCGTTCAG CAGCAGCAGA CCATTTTCAA TCCGCACCTC 3200 

GCGGAAACCG ACATCGCAGG CTTCTGCTTC AATCAGCGTG CCGTCGGCGG 3250 

TGTGCAGTTC AACCACCGCA CGATAGAGAT TCGGGATTTC GGCGCTCCAC 3300 

AGTTTCGGGT TTTCGACGTT CAGACGTAGT GTGACGCGAT CGGCATAACC 3350 

ACCACGCTCA TCGATAATTT CACCGCCGAA AGGCGCGGTG CCGCTGGCGA 3400 

CCTGCGTTTC ACCCTGCCAT AAAGAAACTG TTACCCGTAG GTAGTCACGC 3450 

AACTCGCCGC ACATCTGAAC TTCAGCCTCC AGTACAGCGC GGCTGAAATC 3500 

ATCATTAAAG CGAGTGGCAA CATGGAAATC GCTGATTTGT GTAGTCGGTT 3550 

TATGCAGCAA CGAGACGTCA CGGAAAATGC CGCTCATCCG CCACATATCC 3600 

TGATCTTCCA GATAACTGCC GTCACTCCAA CGCAGCACCA TCACCGCGAG 3650 

GCGGTTTTCT CCGGCGCGTA AAAATGCGCT CAGGTCAAAT TCAGACGGCA 3700 

AACGACTGTC CTGGCCGTAA CCGACCCAGC GCCCGTTGCA CCACAGATGA 3750 

AACGCCGAGT TAACGCCATC AAAAATAATT CGCGTCTGGC CTTCCTGTAG 3800 

CCAGCTTTCA TCAACATTAA ATGTGAGCGA GTAACAACCC GTCGGATTCT 3850 

CCGTGGGAAC AAACGGCGGA TTGACCGTAA TGGGATAGGT TACGTTGGTG 3900 

TAGATGGGCG CATCGTAACC GTGCATCTGC CAGTTTGAGG GGACGACGAC 39.50 
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AGTATCGGCC TCAGGAAGAT CGCACTCCAG CCAGCTTTCC GGCACCGCTT 
CTGGTGCCGG AAACCAGGCA AAGCGCCATT CGCCATTCAG GCTGCGCAAC 
TGTTGGGAAG GGCGATCGGT GCGGGCCTCT TCSCTATTAC GCCAGCTGGC 
CAAAGGGGGA TGTGCTGCAA GGCGATTAAG TTGGGTAACG CCAGGGTTTT 
CCCAGTCACG ACGTTGTAAA ACGACGGGAT CGCGCTTGAG CAGCTCCTTG 
CTGGTGTCCA GACCAATGCC TCCCAGACCG GCAACGAAAA TCACGTTCTT 
GTTGGTCAAA GTAAACGACA TGGTGACTTC TTTTTTGCTT TAGCAGGCTC 
TTTCGATCCC CGGGAATTGC GGCCGCGGGT ACAATTCCGC AGCTTTTAGA 
GCAGAAGTAA CACTTCCGTA CAGGCCTAGA AGTAAAGGCA ACATCCACTG 
AGGAGCAGTT CTTTGATTTG CACCACCACC GGATCCGGGA CCTGAAATAA 
AAGACAAAAA GACTAAACTT ACCAGTTAAC TTTCTGGTTT TTCAGTTCCT 
CGAGTACCGG ATCCTCTAGA GTCCGGAGGC TGGATCGGTC CCGGTCTCTT 
CTATGGAGGT CAAAACAGCG TGGATGGCGT CTCCAGGCGA TCTGACGGTT 
CACTAAACGA GCTCTGCTTA TATAGACCTC CCACCGTACA CGCCTACCGC 
CCATTTGCGT CAATGGGGCG GAGTTGTTAC GACATTTTGG AAAGTCCCGT 
TGATTTTGGT GCCAAAACAA ACTCCCATTG ACGTCAATGG GGTGGAGACT 
TGGAAATCCC CGTGAGTCAA ACCGCTATCC ACGCCCATTG ATGTACTGCC 
AAAACCGCAT CACCATGGTA ATAGCGATGA CTAATACGTA GATGTACTGC 
CAAGTAGGAA AGTCCCATAA GGTCATGTAC TGGGCATAAT GCCAGGCGCG 
CCATTTACCG TCATTGACGT CAATAGGGGG CGTACTTGGC ATATGATACA 
CTTGATGTAC TGCCAAGTGG GCAGTTTACC GTAAATACTC CACCCATTGA 
CGTCAATGGA AAGTCCCTAT TGGCGTTACT ATGGGAACAT ACGTCATTAT 
TGACGTCAAT GGGCGGGGGT CGTTGGGCGG TCAGCCAGGC GGGCCATTTA 
CCGTAAGTTA TGTAACGACC TGCAGGTCGA CTCTAGACGA TCTCCCTAGA 
CAAATATTAC GCGCTATGAG TAACACAAAA TTATTCAGAT TTCACTTCCT 
CTTATTCAGT TTTCCCGCGA AAATGGCCAA ATCTTACTCG GTTACGCCCA 
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FIGURE 3E 

AATTTACTAC AACATCCGCC TAAAACCGCG CGAAAATTGT CACTTCCTGT 5300 
GTACACCGGC GCACACCAAA AACGTCACTT TTGCCACATC CGTCGCTTAC 5350 
ATGTGTTCCG CCACACTTGC AACATCACAC TTCCGCCACA CTACTACGTC 5400 
ACCCGCCCCG TTCCCACGCC CCGCGCCACG TCACAAACTC CACCCCCTCA 
TTATCATATT GGCTTCAATC CAAAATAAGG TATATTATTG ATGATGCTAG 
CGAATTCATC GATGATATCA GATCTGCCGG TCTCCCTATA GTGAGTCGTA 
TTAATTTCGA TAAGCCAGGT TAACCTGCAT TAATGAATCG GCCAACGCGC 
GGGGAGAGGC GGTTTGCGTA TTGGGCGCTC TTCCGCTTCC TCGCTCACTG 5650 
ACTCGCTGCG CTCGGTCGTT CGGCTGCGGC GAGCGGTATC AGCTCACTCA 5700 
AAGGCGGTAA TACGGTTATC CACAGAATCA GGGGATAACG CAGGAAAGAA 5750 
CATGTGAGCA AAAGGCCAGC AAAAGGCCAG GAACCGTAAA AAGGCCGCGT 5800 
TGCTGGCGTT TTTCCATAGG CTCCGCCCCC CTGACGAGCA TCACAAAAAT 5850 
CGACGCTCAA GTCAGAGGTG GCGAAACCCG ACAGGACTAT AAAGATACCA 5900 
GGCGTTTCCC CCTGGAAGCT CCCTCGTGCG CTCTCCTGTT CCGACCCTGC 5950 
CGCTTACCGG ATACCTGTCC GCCTTTCTCC CTTCGGGAAG CGTGGCGCTT 
TCTCAATGCT CACGCTGTAG GTATCTCAGT TCGGTGTAGG TCGTTCGCTC 
CAAGCTGGGC TGTG*GCACG AACCCCCCGT TCAGCCCGAC CGCTGCGCCT 
TATCCGGTAA CTATCGTCTT GAGTCCAACC CGGTAAGACA CGACTTATCG 6150 
CCACTGGCAG CAGCCACTGG TAACAGGATT AGCAGAGCGA GGTATGTAGG 6200 
CGGTGCTACA GAGTTCTTGA AGTGGTGGCC TAACTACGGC TACACTAGAA 
GGACAGTATT TGGTATCTGC GCTCTGCTGA AGCCAGTTAC CTTCGGAAAA 
AGAGTTGGTA GCTCTTGATC CGGCAAACAA ACCACCGCTG CTAGCGGTGG 
TTTTTTTGTT TGCAAGCAGC AGATTACGCG CAGAAAAAAA GGATCTCAAG 
AAGATCCTTT GATCTTTTCT ACGGGGTCTG ACGCTCAGTG GAACGAAAAC 
TCACGTTAAG GGATTTTGGT CATGAGATTA TCAAAAAGGA TCTTCACCTA 
GATCCTTTTA AATTAAAAAT GAAGTTTTAA ATCAATCTAA AGTATATATG 
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FIGURE 3F 

AGTAAACTTG GTCTGACAGT TACCAATGCT TAATCAGTGA GGCACCTATC 6600 
TCAGCGATCT GTCTATTTCG TTCATCCATA GTTGCCTGAC TCCCCGTCGT 6650 
GTAGATAACT ACGATACGGG AGGGCTTACC ATCTGGCCCC AGTGCTGCAA 6700 
TGATACCGCG AGACCCACGC TCACCGGCTC CI 1ATTTATC AGCAATAAAC 6750 
CAGCCAGCCG GAAGGGCCGA GCGCAGAAGT GGTCCTGCAA CTTTATCCGC 6800 
CTCCATCCAG TCTATTAATT GTTGCCGGGA AGCTAGAGTA AGTAGTTCGC 6850 
CAGTTAATAG TTTGCG.CAAC GTTGTTGCCA TTGCTACAGG CATCGTGGTG 6900 
TCACGCTCGT CGTTTGGTAT GGCTTCATTC AGCTCCGGTT CCCAACGATC 6950 
AAGGCGAGTT ACATGATCCC CCATGTTGTG CAAAAAAGCG GTTAGCTCCT 7000 
TCGGTCCTCC GATCGTTGTC AGAAGTAAGT TGGCCGCAGT GTTATCACTC 7050 
ATGGTTATGG CAGCACTGCA TAATTCTCTT ACTGTCATGC CATCCGTAAG 7100 
ATGCTTTTCT GTGACTGGTG AGTACTCAAC CAAGTCATTC TGAGAATAGT 7150 
GTATGCGGCG ACCGAGTTGC TCTTGCCCGG CGTCAATACG GGATAATACC 7200 
GCGCCACATA GCAGAACTTT AAAAGTGCTC ATCATTGGAA AACGTTCTTC 7250 
GGGGCGAAAA CTCTCAAGGA TCTTACCGCT GTTGAGATCC AGTTCGATGT 7300 
AACCCACTCG TGCACCCAAC TGATCTTCAG CATCTTTTAC TTTCACCAGC 7350 
GTTTCTGGGT GAGCAAAAAC AGGAAGGCAA AATGCCGCAA AAAAGGGAAT 7400 
AAGGGCGACA CGGAAATGTT GAATACTCAT ACTCTTCCTT TTTCAATATT 7450 
ATTGAAGCAT TTATCAGGGT TATTGTCTCA TGAGCGGATA CATATTTGAA 7500 
TGTATTTAGA AAAATAAACA AATAGGGGTT CCGCGCACAT TTCCCCGAAA 7550 
AGTGCCACCT GACGTCTAAG AAACCATTAT TATCATGACA TTAACCTATA 7600 
AAAATAGGCG TATCACGAGG CCCTTTCGTC TCGCGCGTTT CGGTGATGAC 7650 
GGTGAAAACC TCTGACACAT GCAGCTCCCG GAGACGGTCA CAGCTTGTCT 7700 
GTAAGCGGAT GCCGGGAGCA GACAAGCCCG TCAGGGCGCG TCAGCGGGTG 7750 
TTGGCGGGTC TCGGGGCTGG CTTAACTATG CGGCATCAGA GCAGATTGTA 7800 
CTGAGAGTGC ACCATATGGA CATATTGTCG TTAGAACGCG GCTACAATTA 7850 
ATACATAACC TTATGTATCA TACACATACG ATTTAGGTGA CACTATA 7897 
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FIGURE 5A 

GAATTCGCTA GCTAGCGGGG GAATACATAC CCGCAGGCGT AGAGACAACA 
TTACAGCCCC CATAGGAGGT ATAACAAAAT TAATAGGAGA GAAAAACACA 
TAAACACCTG AAAAACCCTC CTGCCTAGGC AAAATAGCAC CCTCCCGCTC 
CAGAACAACA TACAGCGCTT CACAGCGGCA GCCTAACAGT CAGCCTTACC 
AGTAAAAAAG AAAACCTATT AAAAAAACAC CACTCGACAC GGCACCAGCT 
CAATCAGTCA CAGTGTAAAA AAGGGCCAAG TGCAGAGCGA GTATATATAG 
GACTAAAAAA TGACGTAACG GTTAAAGTCC ACAAAAAACA CCCAGAAAAC 
CGCACGCGAA CCTACGCCCA GAAACGAAAG CCAAAAAACC CACAACTTCC 
TCAAATCGTC ACTTCCGTTT TCCCACGTTA CGTAACTTCC CATTTTAAGA 
AAACTACAAT TCCCAACACA TACAAGTTAC TCCGCCCTAA AACCTACGTC 
ACCCGCCCCG TTCCCACGCC CCGCGCCACG TCACAAACTC CACCCCCTCA 
TTATCATATT GGCTTCAATC CAAAATAAGG TATATTATTG ATGATGCTAG 
CATCATCAAT AATATACCTT ATTTTGGATT GAAGCCAATA TGATAATGAG 
GGGGTGGAGT TTGTGACGTG GCGCGGGGCG TGGGAACGGG GCGGGTGACG 
TAGTAGTGTG GCGGAAGTGT GATGTTGCAA GTGTGGCGGA ACACATGTAA 
GCGACGGATG TGGCAAAAGT GACGTTTTTG GTGTGCGCCG GTGTACACAG 
GAAGTGACAA TTTTCGCGCG GTTTTAGGCG GATGTTGTAG TAAATTTGGG 
CGTAACCGAG TAAGATTTGG CCATTTTCGC GGGAAAACTG AATAAGAGGA 
AGTGAAATCT GAATAATTTT GTGTTACTCA TAGCGCGTAA TATTTGTCTA 
GGGAGATCAG CCTGCAGGTC GTTACATAAC TTACGGTAAA TGGCCCGCCT 
GGCTGACCGC CCAACGACCC CCGCCCATTG ACGTCAATAA TGACGTATGT 
TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA TGGGTGGAGT 
ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA 
AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA 
TGCCCAGTAC ATGACCTTAT GGGACTTTCC TACTTGGCAG TACATCTACG 
TATTAGTCAT CGCTATTACC ATGGTGATGC GGTTTTGGCA GTACATCAAT 
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FIGURE 5B 

GGGCGTGGAT AGCGGTTTGA CTCACGGGGA TTTCCAAGTC TCCACCCCAT 1350 

TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG GACTTTCCAA 1400 

AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA 1450 

CGGTGGGAGG TCTATATAAG CAGAGCTCGT TTAGTGAACC GTCAGATCGC 1500 

CTGGAGACGC CATCCACGCT GTTTTGACCT CCATAGAAGA CACCGGGACC 1550 

GATCCAGCCT CCGGACTCTA GAGGATCCGG TACTCGAGGA ACTGAAAAAC 1600 

CAGAAAGTTA ACTGGTAAGT TTAGTCTTTT TGTCTTTTAT TTCAGGTCCC 1650 

GGATCCGGTG GTGGTGCAAA TCAAAGAACT GCTCCTCAGT GGATGTTGCC 1700 

TTTACTTCTA GGCCTGTACG GAAGTGTTAC TTCTGCTCTA AAAGCTGCGG 1750 

AATTGTACCC GCGGCCGCAA TTCCCGGGGA TCGAAAGAGC CTGCTAAAGC 1800 

AAAAAAGAAG TCACCATGTC GTTTACTTTG ACCAACAAGA ACGTGATTTT 1850 

CGTTGCCGGT CTGGGAGGCA TTGGTCTGGA CACCAGCAAG GAGCTGCTCA 1900 

AGCGCGATCC CGTCGTTTTA CAACGTCGTG ACTGGGAAAA CCCTGGCGTT 1950 

ACCCAACTTA ATCGCCTTGC AGCACATCCC CCTTTCGCCA GCTGGCGTAA 2000 

TAGCGAAGAG GCCCGCACCG ATCGCCCTTC CCAACAGTTG CGCAGCCTGA 2050 

ATGGCGAATG GCGCTTTGCC TGGTTTCCGG CACCAGAAGC GGTGCCGGAA 2100 

AGCTGGCTGG AGTGCGATCT TCCTGAGGCC GATACTGTCG TCGTCCCCTC 2150 

AAACTGGCAG ATGCACGGTT ACGATGCGCC CATCTACACC AACGTAACCT 2200 

ATCCCATTAC GGTCAATCCG CCGTTTGTTC CCACGGAGAA TCCGACGGGT 2250 

TGTTACTCGC TCACATTTAA TGTTGATGAA AGCTGGCTAC AGGAAGGCCA 2300 

GACGCGAATT ATTTTTGATG GCGTTAACTC GGCGTTTCAT CTCTGGTGCA 2350 

ACGGGCGCTG GGTCGGTTAC GGCCAGGACA GTCGTTTGCC GTCTGAATTT 2400 

GACCTGAGCG CATTTTTACG CGCCGGAGAA AACCGCCTCG CGGTGATGGT 2450 

GCTGCGTTGG AGTGACGGCA GTTATCTGGA AGATCAGGAT ATGTGGCGGA 2500 

TGAGCGGCAT TTTCCGTGAC GTCTCGTTGC TGCATAAACC GACTACACAA 2550 

ATCAGCGATT TCCATGTTGC CACTCGCTTT AATGATGATT TCAGCCGCGC 2600 
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FIGURE 5C 


TGTACTGGAG 
GGCTAACAGT 
GCGCCTTTCG 
CGTCACACTA 
TCCCGAATCT 
CTGATTGAAG 
AAATGGTCTG 
ACCGTCACGA 
ATGGTGCAGG 
CTGTTCGCAT 
ACGGCCTGTA 
CCAATGAATC 
ACGCGTAACG 
TCTGCTCGCT 
TATCGCTGGA 
CGGCGGAGCC 
GCGTGGATGA 
AAATGGCTTT 
CGCCCACGCG 
CGTTTCGTCA 
GATCAGTCGC 
CGGCGGTGAT 
GTCTGGTCTT 
CACCAGCAGC 
GACCAGCGAA 
TGGTGGCGCT 


GCTGAAGTTC AGATGTGCGG 
TTCTTTATGG CAGGGTGAAA 
GCGGTGAAAT TATCGATGAG 
CGTCTGAACG TCGAAAACCC 
CTATCGTGCG GTGGTTGAAC 
CAGAAGCCTG CGATGTCGGT 
CTGCTGCTGA ACGGCAAGCC 
GCATCATCCT CTGCATGGTC 
ATATCCTGCT GATGAAGCAG 
TATCCGAACC ATCCGCTGTG 
TGTGGTGGAT GAAGCCAATA 
GTCTGACCGA TGATCCGCGC 
CGAATGGTGC AGCGCGATCG 
GGGGAATGAA TCAGGCCACG 
TCAAATCTGT CGATCCTTCC 
GACACCACGG CCACCGATAT 
AGACCAGCCC TTCCCGGCTG 
CGCTACCTGG AGAGACGCGC 
ATGGGTAACA GTCTTGGCGG 
GTATCCCCGT TTACAGGGCG 
TGATTAAATAX TGATGAAAAC 
TTTGGCGATA CGCCGAACGA 
TGCCGACCGC ACGCCGCATC 
AGTTTTTCCA GTTCCGTTTA 
TACCTGTTCC GTCATAGOGA 
GGATGGTAAG CCGCTGGCAA 


CGAGTTGCGT GACTACCTAC 2650 

CGCAGGTCGC CAGCGGCACC 2700 

CGTGGTGGTT ATGCCGATCG 2750 

GAAACTGTGG AGCGCCGAAA 2800 

TGCACACCGC CGACGGCACG 2850 

TTCCGCGAGG TGCGGATTGA 2900 

GTTGCTGATT CGAGGCGTTA 2950 

AGGTCATGGA TGAGCAGACC 3000 

AACAACTTTA ACGCCGTGCG 3050 

GTACACGCTG TGCGACCGCT 3100 

TTGAAACCCA CGGCATGGTG 3150 

TGGCTACCGG CGATGAGCGA 3200 

TAATCACCCG AGTGTGATCA 3250 

GCGCTAATCA CGACGCGCTG 3300 

CGCCCGGTGC AGTATGAAGG 3350 

TATTTGCCCG ATGTACGCGC 3400 

TGCCGAAATG GTCCATCAAA 3450 

CCGCTGATCC TTTGCGAATA 3500 

TTTCGCTAAA TACTGGCAGG 3550 

GCTTCGTCTG GGACTGGGTG 3600 

GGCAACCCGT GGTCGGCTTA 3650 

TCGCCAGTTC TGTATGAAOG 3700 

CAGOGCTGAC GGAAGCAAAA 3750 

TCCGGGCAAA CCATCGAAGT 3800 

TAACGAGCTC CTGCACTGGA 3850 

GCGGTGAAGT GCCTCTGGAT 3900 
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FIGURE 5D 

GTCGCTCCAC AAGGTAAACA GTTGATTCAA CTGCCTGAAC TACCGCAGCC 3950 

GGAGAGCGCC GGGCAACTCT GGCTCACAGT ACGCGTAGTG CAACCGAACG 4000 

CGACCGCATG GTCAGAAGCC GGGCACATCA GCGCCTGGCA GCAGTGGCGT 4050 

CTGGCGGAAA ACCTCAGTGT GACGCTCCCC GCCGCGTCCC ACGCCATCCC 4100 

GCATCTGACC ACCAGCGAAA TGGATTTTTG CATCGAGCTG GGTAATAAGC 4150 

GTTGGCAATT TAACCGCCAG TCAGGCTTTC TTTCACAGAT GTGGATTGGC 4200 

GATAAAAAAC AACTGCTGAC GCCGCTGCGC GATCAGTTCA CCCGTGCACC 4250 

GCTGGATAAC GACATTGGCG TAAGTGAAGC GACCCGCATT GACCCTAACG 4300 

CCTGGGTCGA ACGCTGGAAG GCGGCGGGCC ATTACCAGGC CGAAGCAGCG 4350 

TTGTTGCAGT GCACGGCAGA TACACTTGCT GATGCGGTGC TGATTACGAC 4400 

CGCTCACGCG TGGCAGCATC AGGGGAAAAC CTTATTTATC AGCCGGAAAA 4450 

CCTACCGGAT TGATGGTAGT GGTGAAATGG CGATTACCGT TGATGTTGAA 4500 

GTGGCGAGCG ATACACCGCA TCCGGCGCGG ATTGGCCTGA ACTGCCAGCT 4550 

GGCGCAGGTA GCAGAGCGGG TAAACTGGCT CGGATTAGGG CCGCAAGAAA 4600 

ACTATCCCGA CCGCCTTACT GCCGCCTGTT TTGACCGCTG GGATCTGCCA 4650 

TTGTCAGACA TGTATACCCC GTACGTCTTC CCGAGCGAAA ACGGTCTGCG 4700 

CTGCGGGACG CGCGAATTGA ATTATGGCCC ACACCAGTGG CGCGGCGACT 4750 

TCCAGTTCAA CATCAGCCGC TACAGTCAAC AGCAACTGAT GGAAACCAGC 4800 

CATCGCCATC TGCTGCACGC GGAAGAAGGC ACATGGCTGA ATATCGACGG 4850 

TTTCCATATG GGGATTGGTG GCGACGACTC CTGGAGCCCG TCAGTATCGG 4900 

CGGAATTACA GCTGAGCGCC GGTCGCTACC ATTACCAGTT GGTCTGGTGT 4950 

CAAAAATAAT AATAACCGGG CAGGCCATGT CTGCCCGTAT TTCGCGTAAG 5000 

GAAATCCATT ATGTACTATT TAAAAAACAC AAACTTTTGG ATGTTCGGTT 5050 

TATTCTTTTT CTTTTACTTT TTTATCATGG GAGCCTACTT CCCGTTTTTC 5100 

CCGATTTGGC TACATGACAT CAACCATATC AGCAAAAGTG ATACGGGTAT 5150 

TATTTTTGCC GCTATTTCTC TGTTCTCGCT ATTATTCCAA CCGCTGTTTG 5200 

GTCTGCTTTC TGACAAACTC GGCCTCGACT CTAGGCGGCC GCGGGGATCC 5250 
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FIGURE 5B 

AGACATGATA AGATACATTG ATGAGTTTGG ACAAACCACA ACTAGAATGC 5300 

AGTGAAAAAA ATGCTTTATT TGTGAAATTT GTGATGCTAT TGCTTTATTT 5350 

GTAACCATTA TAAGCTGCAA TAAACAAGTT AACAACAACA ATTGCATTCA 5400 

TTTTATGTTT CAGGTTCAGG GGGAGGTGTG GGAGGTTTTT TCGGATCCTC 5450 

TAGAGTCGAC GACGCGAGGC TGGATGGCCT TCCCCATTAT GATTCTTCTC 5500 

GCTTCCGGCG GCATCGGGAT GCCCGCGTTG CAGGCCATGC TGTCCAGGCA 5550 

GGTAGATGAC GACCATCAGG GACAGCTTCA AGGATCGCTC GCGGCTCTTA 5600 

CCAGCCTAAC TTCGATCACT GGACCGCTGA TCGTCACGGC GATTTATGCC 5650 

GCCTCGGCGA GCACATGGAA CGGGTTGGCA TGGATTGTAG GCGCCGCCCT 5700 

ATACCTTGTC TGCCTCCCCG CGTTGCGTCG OGGTGCATGG AGCCGGGCCA 5750 

CCTCGACCTG AATGGAAGCC GGCGGCACCT CGCTAACGGA TTCACCACTC 5800 

CAAGAATTGG AGCCAATCAA TTCTTGCGGA GAACTGTGAA TGCGCAAACC 5850 

AACCCTTGGC AGAACATATC CATCGCGTCC GCCATCTCCA GCAGCCGCAC 5900 

GCGGCGCATC TCGGGCAGCG TTGGGTCCTG GCCACGGGTG CGCATGATCG 5950 

TGCTCCTGTC GTTGAGGACC CGGCTAGGCT GGCGGGGTTG CCTTACTGCT 6000 

TAGCAGAATG AATCACCGAT ACGCGAGCGA ACGTGAAGCG ACTGCTGCTG 6050 

CAAAACGTCT GCGACCTGA6 CAACAACATG AATGGTCTTC GGTTTCCGTG 6100 

TTTCGTAAAG TCTGGAAACG CGGAAGTCAG CGCCCTGCAC CATTATGTTC 6150 

CGGATCTGCA TCGCAGGATG CTGCTGGCTA CCCTGTGGAA CACCTACATC 6200 

TGTATTAACG AAGCCTTTCT CAATGCTCAC GCTGTAGGTA TCTCAGTTCG 6250 

GTGTAGGTCG TTCGCTCCAA GCTGGGCTGT GTGCACGAAC CCCCCGTTCA 6300 

GCCCGACCGC TGCGCCTTAT CCGGTAACTA TCGTCTTGAG TCCAACCCGG 6350 

TAAGACACGA CTTATCGCCA CTGGCAGCAG CCACTGGTAA CAGGATTAGC 6400 

AGAGCGAGCT ATGTAGGCGG TGCTACAGAG TTCTTGAAGT GGTGGCCTAA 6450 

CTACGGCTAC ACTAGAAGGA CAGTATTTGG TATCTGCCCT CTGCTGAAGC 6500 

CAGTTACCTT CGGAAAAAGA GTTGGTAGCT CTTGATCCGG CAAACAAACC 6550 
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FIGURE 5F 

ACCGCTGGTA GCGGTGGTTT TTTTGTTTGC AAGCAGCAGA TTACGCGCAG 6600 

AAAAAAAGGA TCTCAAGAAG ATCCTTTGAT CTTTTCTACG GGGTCTGACG 6650 

CTCAGTGGAA CGAAAACTCA CGTTAAGGGA TTTTGGTCAT GAGATTATCA 6700 

AAAAGGATCT TCACCTAGAT CCTTTTAAAT TA\AAATGAA GTTTTAAATC 6750 

AATCTAAAGT ATATATGAGT AAACTTGGTC TGACAGTTAC CAATGCTTAA 6800 

TCAGTGAGGC ACCTATCTCA GCGATCTGTC TATTTCGTTC ATCCATAGTT 6850 

GCCTGACTCC CCGTCGTGTA GATAACTACG ATACGGGAGG GCTTACCATC 6900 

TGGCCCCAGT GCTGCAATGA TACCGCGAGA CCCACGCTCA CCGGCTCCAG 6950 

ATTTATCAGC AATAAACCAG CCAGCCGGAA GGGCCGAGCG CAGAAGTGGT 7000 

CCTGCAACTT TATCCGCCTC CATCCAGTCT ATTAATTGTT GCCGGGAAGC 7050 

TAGAGTAAGT AGTTCGCCAG TTAATAGTTT GCGCAACGTT GTTGCCATTG 7100 

CTGCAGGCAT CGTGGTGTCA CGCTCGTCGT TTGGTATGGC TTCATTCAGC 7150 

TCCGGTTCCC AACGATCAAG GCGAGTTACA TCATCCCCCA TGTTGTGCAA 7200 

AAAAGCGGTT AGCTCCTTCG GTCCTCCGAT CGTTGTCAGA AGTAAGTTGG 7250 

CCGCAGTGTT ATCACTCATG GTTATGCCAG CACTGCATAA TTCTCTTACT 7300 

GTCATGCCAT CCGTAAGATG CTTTTCTGTG ACTGGTGAGT ACTCAACCAA 7350 

GTCATTCTGA GAATAGTGTA TGCGGCGACC GAGTTGCTCT TGCCCGGCGT 7400 

CAACACGGGA TAATACCGCG CCACATAGCA CAACTTTAAA AGTGCTCATC 7450 

ATTGGAAAAC GTTCTTCGGG GCGAAAACTC TCAAGGATCT TACCGCTGTT 7500 

GAGATCCAGT TCGATGTAAC CCACTCGTGC ACCCAACTGA TCTTCAGCAT 7550 

CTTTTACTTT CACCAGCGTT TCTGGGTGAG CAAAAACAGG AAGGCAAAAT 7600 

GCCGCAAAAA AGGGAATAAG GGCGACACGG AAATGTTGAA TACTCATACT 7650 

CTTCCTTTTT CAATATTATT GAAGCATTTA TCAGGGTTAT TGTCTCATGA 7700 

GCGGATACAT ATTTGAATGT ATTTAGAAAA ATAAACAAAT AGGGGTTCCG 7750 

CGCACATTTC CCC6AAAAGT GCCACCTGAC GTCTAAGAAA CCATTATTAT 7800 

CATGACATTA ACCTATAAAA ATAGGCGTAT CACGAGGCCC TTTCGTCTTC 7850 

AA 7852 
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FIGURE 7A 

TCTTCCGCTT CCTCGCTCAC TGACTCGCTG CGCTCGGTCG TTCGGCTGCG 50 

GCGAGCGGTA TCAGCTCACT CAAAGGCGGT AATACGGTTA TCCACAGAAT 100 

CAGGGGATAA CGCAGGAAAG AACATGTGAG CAAAAGGCCA GCAAAAGGCC 150 

AGGAACCGTA AAAAGGCCGC GTTGCTGGCG TTTTTCCATA GGCTCCGCCC 200 

CCCTGACGAG CATCACAAAA ATCGACGCTC AAGTCAGAGG TGGCGAAACC 250 

CGACAGGACT ATAAAGATAC CAGGCGTTTC CCCCTGGAAG CTCCCTCGTG 300 

CGCTCTCCTG TTCCGACCCT GCCGCTTACC GGATACCTGT CCGCCTTTCT 350 

CCCTTCGGGA AGCGTGGCGC TTTCTCATAG CTCACGCTGT AGGTATCTCA 400 

GTTCGGTGTA GGTCGTTCGC TCCAAGCTGG GCTGTGTGCA CGAACCCCCC 450 

GTTCAGCCCG ACCGCTGCGC CTTATCCGGT AACTATCGTC TTGAGTCCAA 500 

CCCGGTAAGA CACGACTTAT CGCCACTGGC AGCAGCCACT GGTAACAGGA 550 

TTAGCAGAGC GAGGTATGTA GGCGGTGCTA CAGAGTTCTT GAAGTGGTGG 600 

CCTAACTACG GCTACACTAG AAGAACAGTA TTTGGTATCT GCGCTCTGCT 650 

GAAGCCAGTT ACCTTCGGAA AAAGAGTTGG TAGCTCTTGA TCCGGCAAAC 700 

AAACCACCGC TGGTAGCGGT GGTTTTTTTG TTTGCAAGCA GCAGATTACG 750 

CGCAGAAAAA AAGGATCTCA AGAAGATCCT TTGATCTTTT CTACGGGGTC 800 

TGACGCTCAG TGGAACGAAA ACTCACGTTA AGGGATTTTG GTCATGAGAT 850 

TATCAAAAAG GATCTTCACC TAGATCCTTT TAAATTAAAA ATGAAGTTTT 900 

AAATCAATCT AAAGTATATA TGAGTAAACT TGGTCTGACA GTTACCAATG 950 

CTTAATCAGT GAGGCACCTA TCTCAGCGAT CTGTCTATTT CGTTCATCCA 1000 

TAGTTGCCTG ACTCCCCGTC GTGTAGATAA CTACGATACG GGAGGGCTTA 1050 

CCATCTGGCC CCAGTGCTGC AATGATACCC CCAGACCCAC GCTCACCGGC 1100 

TCCAGATTTA TCAGCAATAA ACCAGCCAGC CGGAAGGGCC GAGCGCAGAA 1150 

GTGGTCCTGC AACTTTATCC GCCTCCATCC AGTCTATTAA TTGTTGCCGG 1200 

GAAGCTAGAG TAAGTAGTTC GCCAGTTAAT AGTTTGCGCA ACGTTGTTGC 1250 

CATTGCTACA GGCATCGTGG TGTCACGCTC GTCGTTTGGT ATGGCTTCAT 1300 
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FIGURE 7B 

TCAGCTCCGC TTCCCAACGA TCAAGGCGAG TTACATGATC CCCCATGTTG 1350 
TGCAAAAAAG CGGTTAGCTC CTTCGGTCCT CCGATCGTTG TCAGAAGTAA 1400 
GTTGGCCGCA GTGTTATCAC TCATGGTTAT GGCAGCACTG CATAATTCTC 1450 
TTACTGTCAT GCCATCCGTA AGATGCTTTT CTGTGACTGG TGAGTACTCA 1500 
ACCAAGTCAT TCTGAGAATA GTGTATGCGG CGACCGAGTT GCTCTTGCCC 1550 
GGCGTCAATA CGGGATAATA CCGCGCCACA TAGCAGAACT TTAAAAGTGC 1600 
TCATCATTGG AAAACGTTCT TCGGGGCGAA AACTCTCAAG GATCTTACCG 1650 

CTGTTGAGAT CCAGTTCGAT GTAACCCACT CGTGCACCCA ACTGATCTTC 1700 

AGCATCTTTT ACTTTCACCA GCGTTTCTGG GTGAGCAAAA ACAGGAAGGC 1750 

AAAATGCCGC AAAAAAGGGA ATAAGGGCGA CACGGAAATG TTGAATACTC 1800 

ATACTCTTCC TTTTTCAATA TTATTGAAGC ATTTATCAGG GTTATTGTCT 1850 

CATGAGCGGA TACATATTTG AATGTATTTA GAAAAATAAA CAAATAGGGG 1900 

TTCCGCGCAC ATTTCCCCGA AAAGTGCCAC CTGACGTCTA AGAAACCATT 1950 

ATTATCATGA CATTAACCTA TAAAAATAGG CGTATCACGA GGCCCTTTOG 2000 

TCTCGCGCGT TTCGGTGATG ACGGTGAAAA CCTCTGACAC ATGCAGCTCC 2050 

CGGAGACGGT CACAGCTTGT CTGTAAGCGG ATGCCGGGAG CAGACAAGCC 2100 

CGTCAGGGCG CGTCAGCGGG TGTTGGCGGG TGTCGGGGCT GGCTTAACTA 2150 

TGCGGCATCA GAGCAGATTG TACTGAGAGT GCACCATAAA ATTGTAAACG 2200 

TTAATATTTT GTTAAAATTC GCGTTAAATT TTTGTTAAAT CAGCTCATTT 2250 

TTTAACCAAT AGGCCGAAAT CGGCAAAATC CCTTATAAAT CAAAAGAATA 2300 

GCCCGAGATA GGGTTGAGTG TTGTTCCAGT TTGGAACAAG AGTCCACTAT 2350 

TAAAGAACGT GGACTCCAAC GTCAAAGGGC GAAAAACCGT CTATCAGGGC 2400 

GATGGCCCAC TACGTGAACC ATCACCCAAA TCAAGTTTTT TGGGGTCGAG 2450 

GTGCCGTAAA GCACTAAATC GGAACCCTAA AGGGAGCCCC CGATTTAGAG 2500 

CTTGACGGGG AAAGCCGGCG AACGTGGOGA GAAAGGAAGG GAAGAAAGCG 2550 

AAAGGAGCGG GCGCTAGGGC GCTGGCAAGT GTAGCGGTCA CGCTGCGCGT 2600 
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FIGURE 7C 


AACCACCACA 
GTTGCTTTGA 
AATACCGCAT 
GGGCGATCGG 
ATGTGCTGCA 
GACGTTGTAA 
GGCCACTAGT 
CGTATCGATG 
TTAATATTCC 
TAGAACTATA 
TTGAAGCCAA 
CGTGGGAACG 
TGTTGGGAAT 
ACGGAAGTGA 
GGCGTAGGTT 
TTACGTCATT 
TTACACTGTG 
TAGGTTTTCT 
GCGCTGTATG 
GGGTTTTTCA 
CTCCTATGGG 
CCCAAGCTTG 
CCACACCTCC 
TAACTTGTTT 
CAAATTTCAC 
TCCAAACTCA 


CCCGCCGCGC 
CGTATGCGGT 
CAGGCGCCAT 
TGCGGGCCTC 
AGGCGATTAA 
AACGACGGCC 
ACTTCTCGAG 
GCGCCAGCTG 
GGAGTATACG 
GCTAGCCAAT 
TATGATAATG 
GGGCGGGTGA 
TGTAGTTTTC 
CGATTTGAGG 
CGCGTGCGGT 
TTTTAGTCCT 
ACTGATTGAG 
TTTTTACTGG 
TTGTTCTGGA 
GGTGTTTATG 
GGCTGTAATG 
CATGCCTGCA 
CCCTGAACCT 
ATTGCAGCTT 
AAATAAAGCA 
TCAATGTATC 


TTAATGCGCC 
GTGAAATACC 
TCGCCATTCA 
TTCGCTATTA 
GTTGGGTAAC 
AGTGCCAAGC 
CTCTGTACAT 
CAGGCGGCCG 
TAGCCGGCTA 
TCCATCATCA 
AGGGGGTGGA 
CGTAGGTTTT 
TTAAAATGGG 
AAGTTGTGGG 
TTTCTGGGTG 
ATATATACTC 
CTGGTGCCGT 
TAAGGCTGAC 
GCGGGAGGGT 
TGTTTTTCTC 


TTGTCTCTAC 
GGTCGACTCT 
GAAACATAAA 
ATAATGGTTA 
TTTTTTTCAC 
TTATCATGTC 


GCTACAGGCC 
GCACAGATGC 
GGCTGCGCAA 
CGCCAGCTGG 
GCCAGGGTTT 
TTAAGGTGCA 
GTCCGCGGTC 
CCATATGCAT 
ACGTTAACAA 
ATAATATACC 
GTTTGTGACG 
AGGGCGGAGT 
AAGTTACGTA 
TTTTTTGGCT 
TTTTTTGTGG 
GCTCTGCACT 
GTCGAGTGGT 
TGTTAGGCTG 
GCTATTTTGC 
TCCTATTAAT 
GCCTGCGGGT 
AGAGGATCCG 
ATGAATGCAA 
CAAATAAAGC 
TGCATTCTAG 
TGGATCCCCC 


GCGTACTATG 
GTAAGGAGAA 
CTGTTGGGAA 
CGAAAGGGGG 
TCCCAGTCAC 
CGGCCCACGT 
GCGACGTACG 
CCTAGGCCTA 
CCGGTACCTC 
TTATTTTGGA 
TGGCGCGGGG 
AACTTGTATG 
ACGTGGGAAA 
TTCGTTTCTC 
ACTTTAACCG 
TGGCCCTTTT 
GTTTTTTTAA 
CCGCTGTGAA 
CTAGGCAGGA 
TTTGTTATAC 
ATGTATTCCC 
AAAAAACCTC 
TTGTTGTTGT 
AATAGCATCA 
TTGTGGTTTG 
TAGCTTGCCA 
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FIGURE 7D 


AACCTACAGG 

TGGGGTCTTT CATTCCCCCC TTTTTCTGGA GACTAAATAA 

3950 

AATCTTTTAT TTTATCTATG GCTCGTACTC TATAGGCTTC AGCTGGTGAT 

4000 

ATTGTTGAGT CAAAACTAGA GCCTGGACCA CTGATATCCT GTCTTTAACA 

4050 

AATTGGACTA 

ATCGCGGGAT CAGCCAATTC CATGAGCAAA TGTCCCATGT 

4100 

CAACATTTAT 

GCTGCTCTCT AAAGCCTTGT ATCTTGCATC TCTTCTTCTG 

4150 

TCTCCTCTTT 

CAGAGCAGCA ATCTGGGGCT TAGACTTGCA CTTGCTTGAG 

4200 

TTCCGGTGGG 

GAAAGAGCTT CACCCTGTCG GAGGGGCTGA TGGCTTGCCG 

4250 

GAAGAGGCTC 

CTCTCGTTCA GCAGTTTCTG GATGGAATCG TACTGCCGCA 

4300 

CTTTGTTCTC 

TTCTATGACC AAAAATTGTT GGCATTCCAG CATTGCTTCT 

4350 

ATCCTGTGTT 


4400 

AGTTCTTCTA 

ATTATTTGGT ATGTTACTGG ATCCAAATGA GCACTGGGTT 

4450 

CATCAAGCAG 

CAAGATCTTC GCCTTACTGA GAAGAGATCT AGCCAAGCAC 

4500 

ATCAACTGCT 

TGTGGCCATG GCTTAGGACA CAGCCCCCAT CCACAAGGAC 

4550 

AAAGTCAAGC TTCCCAGGAA ACTGTTCTAT CACAGATCTG AGCCCAACCT 

4600 

CATCTGCAAC TTTCCATATT TCTTGATCAC TCCACTGTTC ATAGGGATCC 

4650 

AAGTTTTTTC TAAATGTTCC AGAAAAAATA AATACTTTCT GTGGTATCAC 

4700 

TCCAAAGGCT 

TTCCTCCACT GTTGCAAAGT TATTGAATCC CAAGACACAC 

4750 

CATCGATCTG 

GATTTCTCCT TCAGTGTTCA GTAGTCTCAA AAAAGCTGAT 

4800 

AACAAAGTAC TCTTCCCTGA TCCAGTTCTT CCCAAGAGGC CCACCCTCTG 

4850 

GCCAGGACTT ATTGAGAAGG AAATGTTCTC TAATATGGCA TTTCCACCTT 

4900 

CTGTGTATTT 

TGCTGTGAGA TCTTTGACAG TCATTTGGCC CCCTGAGGGC 

4950 

CAGATGTCAT 

CTTTC1TCAC GTGTGAATTC TCAATAATCA TAACTTTCGA 

5000 

GAGTTGGCCA 

TTCTTGTATG GTTTGGTTGA CTTGGTAGGT TTACCTTCTG 

5050 

TTGGCATGTC AATGAACTTA AAGACTCGGC TGACAGATCG CATCAAGCTA 

5100 

TCCACATCTA 

TGCTGGAGTT TACAGCCCAC TGCAATGTAC TCATGATATT 

5150 

CATGGCTAAA 

GTCAGGATAA TACCAACTCT TCCTTCTCCT TCTCCTGTTG 

5200 
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FIGURE 7E 


TTAAAATGGA AATGAAGGTA ACAGCAATGA AGAAGATGAC AAAAATCATT 5250 

TCTATTCTCA TTTGGAACCA GCGCAGTGTT GACAGGTACA AGAACCAGTT 5300 

GGCAGTATGT AAATTCAGAG CTTTGTGGAA CAGAGTTTCA AAGTAAGGCT 5350 

GCCGTCCGAA GGCACGAAGT GTCCATAGTC CTTTTAAGCT TGTAACAAGA 5400 

TGAGTGAAAA TTGGACTCCT GCCTTCAGAT TCCAGTTGTT TGAGTTGCTG 5450 

TGAGGTTTGG AGGAAATATG CTCTCAACAT AATAAAAGCC ACTATCACTG 5500 

GCACTGTTGC AACAAAGATG TAGGGTTGTA AAACTGCGAC AACTGCTATA 5550 

GCTCCAATCA CAATTAATAA CAACTGGATG AAGTCAAATA TGGTAAGAGG 5600 

CAGAAGGTCA TCCAAAATTG CTATATCTTT GGAGAATCTA TTAAGAATCC 5650 

CACCTGCTTT CAACGTGTTG AGGGTTGACA TAGGTGCTTG AAGAACAGAA 5700 

TGTAACATTT TGTGGTGTAA AATTTTCGAC ACTGTGATTA GAGTATGCAC 5750 

CAGTGGTAGA CCTCTGAAGA ATCCCATAGC AAGCAAAGTG TCGGCTACTC 5800 

CCACGTAAAT GTAAAACACA TAATACGAAC TGGTGCTGGT GATAATCACT 5850 

GCATAGCTGT TATTTCTACT ATGAGTACTA TTCCCTTTGT CTTGAAGAGG 5900 

AGTGTTTCCA AGGAGCCACA GCACAACCAA AGAAGCAGCC ACCTCTGCCA 5950 

GAAAAATTAC TAAGCACCAA ATTAGCACAA AAATTAAGCT CTTGTGGACA 6000 

GTAATATATC GAAGGTATGT GTTCCATGTA GTCACTGCTC GTATGCTCTC 6050 

CATATCATCA AAAAAGCACT CCTTTAAGTC TTCTTCGTTA ATTTCTTCAC 6100 

TTATTTCCAA GCCAGTTTCT TGAGATAACC TTCTTGAATA TATATCCAGT 6150 

TCAGTCAAGT TTGCCTGAGG GGCCAGTGAC ACTTTTCGTG TGGATGCTGT 6200 

TGTCTTTCGG TGAATGTTCT GACCTTGGTT AXCTGAGTGT GTCATCAGGT 6250 

TCAGGACAGA CTGCCTCCTT CGTGCCTGAA GCGTGGGGCC AGTGCTGATC 6300 

ACGCTGATGC GAGGCAGTAT CGCCTCTCCC TGCTCAGAAT CTGGTACTAA 6350 

GGACAGCCTT CTCTCTAAAG GCTCATCAGA ATCCTCTTCG ATGCCATTCA 6400 

TTTGTAAGGG AGTCTTTTGC ACAATGGAAA ATTTTCGTAT AGAGTTGATT 6450 

GGATTGAGAA TAGAATTCTT CCTTTTTTCC CCAAACTCTC CAGTCTGTTT 6500 
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FIGURE 7P 

AAAAGATTGT TTTTTTGTTT CTGTCCAGGA GACAGGAGCA TCTCCTTCTA 6550 
ATGAGAAACG GTGTAAGGTC TCAGTTAGGA TTGAATTTCT TCTTTCTGCA 6600 
CTAAATTGGT CGAAAGAATC ACATCCCATG AGTTTTGAGC TAAAGTCTGG 6650 
CTGTAGATTT TGGAGTTCTG AAAATGTCCC ATAAAAATAG CTGCTACCTT 6700 
CATGCAAAAT TAATATTTTG TCAGCTTTCT TTAAATGTTC CATTTTAGAA 6750 
GTGACCAAAA TCCTAGTTTT GTTAGCCATC AGTTTACAGA CACAGCTTTC 6800 
AAATATTTCT TTTTCTGTTA AAACATCTAG GTATCCAAAA GGAGAGTCTA 6850 
ATAAATACAA ATCAGCATCT TTGTATACTG CTCTTGCTAA AGAAATTCTT 6900 
GCTCGTTGAC CTCGACTCAG TGTGATTCCA CCTTCTCCAA GAACTATATT 6950 
GTCTTTCTCT GCAAACTTGG AGATGTCCTC TTCTAGTTGG CATGCTTTGA 7000 
TGACGCTTCT GTATCTATAT TCATCATAGG AAACACCAAA GATGATATTT 7050 
TCTTTAATGG TGCCAGGCAT AATCCAGGAA AACTGAGAAC AGAATGAAAT 7100 
TCTTCCACTG TGCTTAATTT TACCCTCTGA AGGCTCCAGT TCTCCCATAA 7150 

TCATCATTAG AAGTGAAGTC TTGCCTGCTC CAGTGGATCC AGCAACCGCC 7200 

AACAACTGTC CTCTTTCTAT CTTGAAATTA ATATCTTTCA GGACAGGAGT 7250 

ACCAAGAAGT GAGAAATTAC TGAAGAAGAG GCTGTCATCA CCATTAGAAG 7300 

TTTTTCTATT GTTATTGTTT TGTTTTGCTT TCTCAAATAA TTCCCCAAAT 7350 

CCCTCCTCCC AGAAGGCTGT TACATTCTCC ATCACTACTT CTGTAGTCGT 7400 

TAAGTTATAT TCCAATGTCT TATATTCTTG CTTTTGTAAG AAATCCTGTA 7450 

TTTTGTTTAT TGCTCCAAGA GAGTCATACC ATGTTTGTAC AGCCCAGGGA 7500 

AATTGCCGAG TGACCGCCAT GCGCAGAACA ATGCAGAATG AGATGGTGGT 7550 

GAATATTTTC CGGAGGATGA TTCCTTTGAT TAGTGCATAG GGAAGCACAG 7600 

ATAAAAACAC CACAAAGAAC CCTGAGAAGA AGAAGGCTGA GCTATTGAAG 7650 

TATCTCACAT AGGCTGCCTT CCGAGTCAGT TTCAGTTCTG TTTGTCTTAA 7700 

GTTTTCAATC ATTTTTTCCA TTGCTTCTTC CCAGCAGTAT GCCTTAACAG 7750 

ATTGGATGTT CTCGATCATT TCTGAGGTAA TCACAAGTCT TTCACTGATC 7800 
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FIGURE 7G 

TTCCCAGCTC TCTGATCTCT GTACTTCATC ATCATTCTCC CTAGCCCAGC 7850 

CTGAAAAAGG GCAAGGACTA TCAGGAAACC AAGTCCACAG AAGGCAGACG 7900 

CCTGTAACAA CTCCCAGATT AGCCCCATGA GGAGTGCCAC TTGCAAAGGA 7950 

GCGATCCACA CGAAATGTGC CAATGCAAGT CCTTCATCAA ATTTGTTCAG 8000 

GTTGTTGGAA AGGAGACTAA CAAGTTGTCC AATACTTATT TTATCTAGAA 8050 

CACGGCTTGA CAGCTTTAAA GTCTTCTTAT AAATCAAACT AAACATAGCT 8100 

ATTCTCATCT GCATTCCAAT GTGATGAAGG CCAAAAATGG CTGGGTGTAG 8150 

GAGCAGTGTC CTCACAATAA AGAGAAGGCA TAAGCCTATG CCTAGATAAA 8200 

TCGCGATAGA GCGTTCCTCC TTGTTATCCG GGTCATAGGA AGCTATGATT 8250 

CTTCCCAGTA AGAGAGGCTG TACTGCTTTG GTGACTTCCC CTAAATATAA 8300 

AAAGATTCCA TAGAACATAA ATCTCCAGAA AAAACATCGC CGAAGGGCAT 8350 

TAATGAGTTT AGGATTTTTC TTTGAAGCCA GCTCTCTATC CCATTCTCTT 8400 

TCCAATTTTT CAGATAGATT GTCAGCAGAA TCAACAGAAG GGATTTGGTA 8450 

TATGTCTGAC AATTCCAGGC GCTGTCTGTA TCCTTTCCTC AAAATTGGTC 8500 

TGGTCCAGCT GAAAAAAAGT TTGGAGACAA CGCTGGCCTT TTCCAGAGGC 8550 

GACCTCTGCA TGGTCTCTCG GGCGCTGGCG TCCCTGCTAG GGCCGTCTGG 8600 

GCTCAAGCTC CTAATGCCAA AGGAATTCCT GCAGCCCGGG GGATCCACTA 8650 

GTTCTAGAGC GGCCGCCACC GCGGTGGCTG ATCCCGCTCC CGCCCGCCGC 8700 

GCGCTTCGCT TTTTATAGGG CCGCCGCCGC CGCCGCCTCG CCATAAAAGG 8750 

AAACTTTCGG AGCGCGCCGC TCTGATTGGC TGCCGCCGCA CCTCTCCGCC 8800 

TCGCCCCGCC CCGCCCCTCG CCCCGCCCCG CCCCGCCTGG CGCGCGCCCC 8850 

CCCCCCCCCC CCGCCCCCAT CGCTGCACAA AATAATTAAA AAATAAATAA 8900 

ATACAAAATT GGGGGTGGGG AGGGGGGGGA GATGGGGAGA GTGAAGCAGA 8950 

ACGTGGCCTC GAGTAGATGT ACTGCCAAGT AGGAAAGTCC CATAAGGTCA 9000 

TGTACTGGGC ATAATGCCAG GCGGGCCATT TACCGTCATT GACGTCAATA 9050 

GGGGGCGTAC TTGGCATATG ATACACTTGA TGTACTGCCA AGTGGGCAGT 9100 
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FIGURE 7H 

TTACCGTAAA TACTCCACCC ATTGACGTCA ATGGAAAGTC CCTATTGGCG 9150 

TTACTATGGG AACATACGTC ATTATTGACG TCAATGGGCG GGGGTCGTTG 9200 

GGCGGTCAGC CAGGCGGGCC ATTTACCGTA AGTTATGTAA CGACCTGCAG 9250 

GCTGATCTCC CTAGACAAAT ATTACGCGCT ATGAGTAACA CAAAATTATT 9300 

CAGATTTCAC TTCCTCTTAT TCAGTTTTCC CGCGAAAATG GCCAAATCTT 9350 

ACTCGGTTAC GCCCAAATTT ACTACAACAT CCGCCTAAAA CCGCGCGAAA 9400 

ATTGTCACTT CCTGTGTACA CCGGCGCACA CCAAAAACGT CACTTTTGCC 9450 

ACATCCGTCG CTTACATGTG TTCCGCCACA CTTGCAACAT CACACTTCCG 9500 

CCACACTACT ACGTCACCCG CCCCGTTCCC ACGCCCCGCG CCACGTCACA 9550 

AACTCCACCC CCTCATTATC ATATTGGCTT CAATCCAAAA TAAGGTATAT 9600 

TATTGATGAT GCTAGCATGC GCAAATTTAA AGCGCTGATA TCGATCGCGC 9650 
GCAGATCTGT CATGATGATC ATTGCAATTG GATCCATATA TAGGGCCCGG 
GTTATAATTA CCTCAGGTCG ACGTCCCATG GCCATTCGAA TTCGTAATCA 

TGGTCATAGC TGTTTCCTGT GTGAAATTGT TATCCGCTCA CAATTCCACA 9800 

CAACATACGA GCCGGAAGCA TAAAGTGTAA AGCCTGGGGT GCCTAATGAG 9850 

TGAGCTAACT CACATTAATT 6CGTTGCGCT CACTGCCCGC TTTCCAGTCG 9900 

GGAAACCTGT CGTGCCAGCT GCATTAATGA ATCGGCCAAC GCGCGGGGAG 9950 

AGGCGGTTTG CGTATTGGGC GC 9972 


9700 
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FIGURE 12A 


CCAATTCCAT CATCAATAAT ATACCTTATT TTGGATTGAA 6CCAATATGA 

50 

TAAT6A66GG 

GTGGAGTTTG 

TGACGTGGCG 

CGGGGCGTGG GAACGGGGCG 

100 

6GTGACGTAG 

GTTTTAGGGC 

GGAGTAACTT 

GTATGTGTTG GGAATTGTAG 

150 

TTTTCTTAAA 

ATGGGAAGTT 

ACGTAACGTG 

GGAAAACGGA AGTGACGATT 

200 

TGAGGAAGTT 

GTGGGTTTTT 

TGGCTTTCGT 

TTCTGGGCGT AGGTTCGCGT 

250 

GCGGTTTTCT 

GGGTGTTTTT 

TGTGGACTTT 

AACCGTTACG TCATTTTTTA 

300 

GTCCTATATA 

TACTCGCTCT 

GCACTTGGCC 

CTTTTTTACA CTGTGACTGA 

350 

TTGAGCTGGT 

GCCGTGTCGA 

GTGGTGTTTT 

TTTAATAGGT TTTCTTTTTT 

400 

ACTGGTAAGG 

CTGACTGTTA 

GGCTGCCGCT 

GTGAAGCGCT GTATGTTGTT 

450 

CTGGAGCGGG 

AGGGTGCTAT 

TTTGCCTAGG 

CAGGAGGGTT TTTCAGGTGT 

500 

TTATGTGTTT 

TTCTCTCCTA 

TTAATTTTGT 

TATACCTCCT ATGGGGGCTG 

550 

TAATGTTGTC 

TCTACGCCTG 

CGGGTATGTA 

TTCCCCCCAA GCTTGCATGC 

600 

CTGCAGGTCG 

ACTCTAGAGG 

ATCCGAAAAA 

ACCTCCCACA CCTCCCCCTG 

650 

AACCTGAAAC 

ATAAAATGAA 

TGGAATTGTT 

GTTGTTAACT TGTTTATTGC 

700 

AGCTTATAAT 

GGTTACAAAT 

AAAGCAATAG 

CATCACAAAT TTCACAAATA 

750 

AAGCATTTTT 

TTCACTGCAT 

TCTAGTTGTG 

GTTTGTCCAA ACTCATCAAT 

800 

GTATCTTATC 

ATGTCTGGAT 

CCCCGCGGCC 

GCTCTAGAAC TAGTGGATCC 

850 

CCCGGGCTGC 

AGGAATTCCG 

TAACATAACT 

GCGTGCTTTA TTGAGATACA 

900 

CAGTAAAGCA GTAATATAAT ACAATAGTAA GGCATATATT TGGTGAAATC 

950 

TGATATGTTG 

TGAAAATGCA 

GTAAAACTGA 

AGTTTAAAAA AATAATTAGT 

1000 

AAATGTTACA 

GTGTTGGTCT 

TAAAACACAA 

TCTATTATGA TACTCAAGTA 

1050 

AGAGTCCAGT 

ACCTGGAGAC 

AATGATGATA 

CATGCCATGT GATGATTATG 

1100 

CTTCAGTTAC 

ACTGATTATG 

ATTTACACTT 

TAATACTTGA TGGTTATAAA 

1150 

GAACATGAAA 

TGATGTCCAA 

ATTATGCTTA 

AAATCAGCAA TAAAGCTCTC 

1200 

AGTTTTTATT 

CAAATATTTT GATAGATTCA 

CTCCAGAACT AATATCTAAA 

1250 
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FIGURE 

12B 



AGATAAAACG 

AAAAGATTAA 

AACAAAACTA 

TGCACTCTAT 

CTACCTTGGA 

1300 

TTTTAGAATG 

AAACTTAAAA 

CTTCTTAGTA 

GGAAAGGAAC 

CCCTTGTTTT 

1350 

AAATCTTGGT 

GAAAACAAAT 

CCTTGGATAA 

AGAAAATGCC 

CAGTGCCACA 

1400 

TAAAGGAGAG 

AGAGAGAGAA 

AAGCAAGACC 

AGAACCAAAT 

TTCAATTTGT 

1450 

TATCTTAGAG 

CTTTGGGTTT 

TCTTTTGGAA 

ATTATAAATG' 

AAAAAAGGAA 

1500 

ACTGGTGTCC 

ACACAACAGA 

CAAGTGGTGA 

AGTTGTGAAA 

TTAGGTGTGC 

1550 

ACAATTACTA 

GAAACACCCC 

AAAACCAAAG 

TGAGGTAGAA 

ATAGCATGAG 

1600 

AAGCTGTGTt 

TGATGTTAAT 

TACAATTAAT 

AATGGACAAA 

ACCCACTCGC 

1650 

TAGAAGTTAA 

TTACACTTGA 

CGTTAGAGGT 

AACAGATTTG 

CAAAATGATA 

1700 

GGACAGTGAT 

TTCTATTGAG 

AGAATGCTCT 

TTAAATGCTA 

AGAAGAAGAA 

1750 

ACTGGCATGA 

GAGGAGTAAA 

GCTCTTCCTA 

GCAGTCCTTA 

GCTTTCTGTT 

1800 

GCACTTTTTC 

TCCTGGTTCA 

ATGACTTGCA 

TTTGTTTAGA 

CATTTCAGCC 

1850 

CGTCAACTAG 

ACCAGAGAGT 

TTGGAGACGC 

TTTTGCTCTC 

AAAACTTTCC 

1900 

AACCACTGTG 

CCTTCTCACC 

CACAATCCTG 

TGTGGAGTTA 

CTTGCAGGGA 

1950 

AACCAATGCA 

AAGGAGACAA 

ATGCAGTTCA 

TGGGCTTCTG 

GACTGATATT 

2000 

CACCAGGGTC ACAATGTGAT 

TGGGTTACTT 

TCTTAACAGT 

AATCCTAAGT 

2050 

CTTGCAGCAT 

TAAAAAAAAA 

AATCATCACA 

ATGAAGAAAA 

AAAAACCCAA 

2100 

AAAATCTAAA 

ATCTAAAATT 

CATCATCATC 

ATCAACAACA 

ACAACAACAA 

2150 

CAACAACAAA ACCACCCACT 

TCAGGTTGA6 

TTTATGAAGA 

GGGCAGAACA 

2200 

ATTTAGTTGT AATTATAGAG 

ATGTTTATAT 

GTATAGTTGT 

AAATATTCAT 

2250 

CCATTCTTTT ACAGAGTTGT 

TGCTCCCCTC 

ATATAAATTG 

ACTGAGGAGC 

2300 

CGCAACCTTT 

AGCTCCTACC 

ATCTTCCTCC 

TACTGTCTGG 

GAGTTAAAAA 

2350 

TGTCATCTGA TGTTCTATTG 

CAGAAACATC 

ATTAAATATA 

ACCCAACAGT 

2400 

AGGAAGTTGA ATATATCAGC 

CAACAAATTA 

CTATGATAGT 

AAGTCCTGTG 

2450 

TATTCATTCG CATGTTCCTT 

GAAAAAAATG 

AATCCTCTAG 

CTCTCAGTGG 

2500 
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FIGURE 12C 

AAAGTTTAAA ACTAGAAACA TCTGGAGCCC TAGACAATAT TTTAGTGTGG 2550 

CGGTAGTCTC CTGGCTTTGG GCTCCAGGGA AAATTCACTC TTGCCCAAGC 2600 

AGATAAGCCC AGATGACTAG AAGCAATTTC CATAGGAAG TGGCAAGAAC 2650 

ATTTGAAGAA GTAACTTCAT ATCTATTTAT CTATATACCT ATAGTATTTA 2700 

TATACTTGTA GACATATAGA TGTATAAAAT GAAAGCCCAT AGCCAGCCCC 2750 

ACTCAGTCAA CAATTCTCAA AAGAGCAATA TGAAGCAGTC ATTTGGTGGG 2800 

GTTCGTATGC AAGAAAATAA AAAAACGTCA TGAATTCCAT ATGAATACCA 2850 

CGCTAAAGTA ATGCAAAACA ATGTGCTGCC TCAGTGTGTG TGTGTGTGTG 2900 

TGTGTGTGTG GTGGGTTCGT GCATGTATGT GTGCGTGTGT GTGTGTGTGT 2950 

GTGTGTGTGT GTGTGTGTGC GTGTGTGTTT GTTTAGGGGT TTTTATAAAC 3000 

AACTTTTTTT ATAAAGCACA CTTTAGTTTA CAATCTCTCT TTATAACTGT 3050 

TATAAATTTT TAAACAACCC AAAATGCGTT CCATATAAAG AAATGGCAAG 3100 

TTATTTAGCT ATCAAGATTT TACATGTTTT CTTTTAACTT TTTTGTACAA 3150 

TTGCATAGAC GTGTAAAACC TGCCATTGTT AACAAAACAA TAACAGACTT 3200 

AGAAACTACT GAAATCTACA GTATAGTACC ACTACCCTTC ACAAAAATAT 3250 

AGATTTTATT TCTTGTAAAC TCTTACTGTC TAATCCTCTT TGTTGTACGA 3300 

ATATTATAAA AACCATGCGG GAATCAGGAG TTGTAAAACA TTTATTCTGC 3350 

TCCTTCTTCA TCTGTCATGA CTGAAACTAA GGACTCCATC GCTCTGCCCA 3400 

AATCATCTGC CATGTGGAAA AGGCTTCCTA CATTGTGTCC TCTCTCATTG 3450 

GCTTTCCGGG GGCATTTCTT CCTCTTGAAC TAGGGAAGGA GTTGTTGAGT 3500 

TGCTCCATCA CTTCTTCTAA CCCTGTGCTT GTGTCCTGGG GAGGACTCAG 3550 

AAGATCTTCC TCACCCATAG ATTCTGAAGT TTGACTGCCA ACCACTCGGA 3600 

GCAGCATAGG CTGACTGCTA TCTGACCTCT GCAGAGAGGT GGAAGGAGAG 3650 

GACACCGTGG TGCCATTCAC CTTAGCTTCA GCCTGGGGCT GCTCCAGGAG 3700 

CTGTCTCAGT CTATGTAACT GAGACTCCAG CTGTTTATTG TGGTCTTCCA 3750 
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GGATTTGCAT 

CCTGGCTTCC 

AGGCGTCCTT 

TGTGTTGGCG 

CAGTAGCTTA 

3800 

GCCTCAGCAA 

TGAGCTCAGC 

ATCCCTGGGA 

CTCTGAGGAG 

AGGTGGGCAT 

3850 

CATCTCAGGA 

GGAGATGGCA 

GTGGAGACAG 

GCCTTTATGC 

TCATGCTGCT 

3900 

GCTTCAG6CG 

ATCATATTCT 

GCTTGCAGAT 

TCCTGTTTTC 

TTCCTCAAGA 

3950 

TCTG CTAGGA 

TTCTCTCTAG 

CTCCCCTCTT 

TCCTCACTCT 

CTAAGGAAAT 

4000 

CAAGATCTGG 

GCAGGACTAC 

GAGGCTGGCT 

CAGGGGGGAG 

TCCTGGTTCA 

4050 

AACTTTGGCA 

GTAATGCTGG 

ATTAACAAAT 

GTTCATCATC 

TATGCTCTCA 

4100 

TTAGGAGAGA 

TGCTATCATT 

TAGATAAGAT 

CCATTGCTGT 

TTTCCATTTC 

4150 


CTAGCATAAT 


TGAATGAGTA 

TCATCGTGTG 

4200 

& A ACCTGfiGG 

GGACGAGGCA 

GGCGCAGAAT 

CTACTGGCCA 

GAAGTTGATC 

4250 

AwUi lAAWwv 

GAGTTTCCAT 

GTTGTCCCCC 

TCTAACACAG 

TCTGCACTGG 

4300 


AX XVvUWVj^sAX 

GCTTCGCAAA 

A7ACCTTT7G 

GTTCGAAATT 

4350 

TGTTTTTTAG 

TACCTTGGCG 

AAUlbUVAUIA 


GGATGTAGTC 

4400 

wnuluvAAl 

AU1V.1AUJU 



CCTTTGCAAC 

4450 


AAAAAGCAAC 

TTTGGCAGAT 

GTCATAATTA 

AAATGCTTTA 

4500 

GGCTTCTGTA 


ATGATTGGAC 

ACTCCTTACA 

GATGTTACAC 

4550 

TTGGCTTGAT 

GCTTGGCA6T 

TTCAGCAGCA 


GCAAGACGGG 

4600 


ATAGACFGfiC 


CATCCAGTCA 

AGGAAGAGAG 

4650 

www X X wwi 

CTCAGGTTTA 

TTATTGGCAA 

ATTGGAAGCA 

GCTCCTGACA 

4700 

CTCGGCTCAA 

TGTTACTGCC 

CCCAAAGGAA 

bUUtUx IVAb 

X W X 

4750 

TGGGATTTGA 

ATAGAATCAT 

GCAGAAGAAG 

ACCCAGCCTA 

CGCTGGTCAC 

4800 

AAAAGCCAGT 

TGAACTTGCC 

ACTTGCTTGA 

AAAGGTATCT 

GTACTTGTCT 

4850 

TCCAAGTGTG 

CTTTACACAG 

AGAAATGATG 

CCAGTTTTAA 

AAGACAGGAC 

4900 

ACGGATCCTC 

CCTGTTCGTC 

CCGTATCATA 

AACATTGAGA 

AGCCAGTTGA 

4950 

GACACATATC 

CACACAGAGA GGGACATTGA 

CCAGATTGTT 

GTGCTCTTGC 

5000 

TCCAGAC6AT 

CATAAATTGT AGTCAAACAG 

TTAATTATCT 

GCAGGATATC 

5050 
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FIGURE 12E 

CATGGGCTGG TCATTTTGCT TGAGGTTGTG CTGGTCCAGG GCATCACATG 5100 

CAGCTGACA6 GCTCAAGAGA TCCAAGCAAA GGGCCTTCTG GAGCCTTCTG 5150 

AGCTTCATGG CAGTCCTATA CGCGGAGAAC CTGACATTAT TCAGGTCAGC 5200 

TAAAGACTGG TAGAGCTCTG TCATTTTGGG GTCGTCCCAA CAAGTGGTTT 5250 

GGGTCTCGTG GTTGATATAG TAGGGCACTT TGTTTGGTGA GATGGCTCTC 5300 

TCCCAGGGAC CCTGAACTGA AGTGGAAAGG AAGTGCTGGG ATGCAGGACC 5350 

AAAGTCCCTG TGGGCTTCAT GCAGCTGTCT GACACGGTCC TCCACAGCCA 5400 

CCTGTAGAAG CCTCCATCTG GTATTCAGAT CTTCCAAAGT GCTGAGGTTA 5450 

TAAGGTGAGA GCTGAATGCC CAGTGTGGTC AGCTGATGTG CAAGGTCATT 5500 

GACACGATTG ACATTCTCTT TAAGAGGTGC AATTTCTCCC CGAAGTGCCT 5550 

TGACTTTTTC AAGGTGATCT TGCAGAGAGT CAATGAGGAG ATCCCCCACT 5600 

GGCTGCCAGG ATCCCTTGAT CACCTCAGCT TGGCGCAACT TGAGGTCCAG 5650 

TTCATCGGCA GCTTCCTGAA GTTCCTGGAG TCTTTCAAGA GCTTCATCTA 5700 

TTTTTCTCTG CCAATCAGCT GAGCGCAGGT TCAATTTGTC CCATTCAGCG 5750 

TTGACCTCTT CAGCCTGCTT TCGTAGGAGC CGAGTGACAT TCTGAGCTCT 5800 

TTCTTCAGGA GGCAGTTCTC TGGGCTCCTG GTAGAGTTTC TCTAGTCCTT 5850 

CCAAAGGCTG CTCTGTCAGA AATATTCTCA CAGTCTCCAG AGTACTCATG 5900 

ATTACAGGTT CTTTAGTTTT CAATTCCCTC TTGAAGGCCC TATGTATATC 5950 

ATTCTGCTTC TGAACTGCTG GCAAATCACC ACOGATGGGT GCCTGACGGC 6000 

TCAGTTCATC ATCTTTCAGC TGTACCCAAA CAAGAAGTTC CTGAAGAGAA 6050 

AGATGCAAAC GCTTCCACTG GTCAGAACTT GCTTCCAAAT GGGACCTAAT 6100 

GTTGAGAGAC TTTTTCTGAA GTTCACTCCA CTTGAAATTC ATGTTATCCA 6150 

AACGTCTTTG TAACAGGGGT GCTTCATCCG AACCTTCCAG GGATCTCAGG 6200 

ATTTTTTGGC CATTTTCATC AAGATTGTGA TAGATATCTG TGTGAGTTTC 6250 

AATTTCTCCT TGGAGATCTT GCCATGGTTT CATCAGCTCT CTGACTCCCC 6300 

TGGAGTCTTC TAGGAGCTTC TCCTTACGGG AAGOGTCCTG TAGGACATTG 6350 
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FIGURE 

12F 



GCAGTTGTTT 

CTGCTTCCGT AATCCAGGAA 

AGAAACTTCT 

CCAGGTCCAG 

6400 

AGG6AACTGC 

TGCAGTAATC TATGAGTTTC 

TTCCAAAGCA 

GCCTCTTGCT 

6450 

CACTTACTCT 

TTTATGAATG 

TTTCCCCAAG 

AAGTATTGAT 

ATTCTCTGTT 

6500 

ATCATGTGTA 

CTTTTCTGGT 

ATCATCAGGA 

GAATAGTCCC 

GAAGAAGTTT 

6550 

CAGTGCCAAA 

TCATTTGCCA CGTCTACACT 

TATCTGCCGT 

TGACGGAGGT 

6600 

CTTTGGCCAA 

CTGCTTGGTT 

TCTGTGATCT 

TCTTTTGGAT 

TGCATCTACT 

6650 

GTGTGAGGAC 

CTTCTTTCCA 

TGAGTCAAGC 

TTGCCTCTGA 

CCTGTCCTAT 

6700 

GACCTGTTCG 

GCTTCTTCCT 

TAGCTTCCAG 

CCATTGTGTT 

GAATCCTTTA 

6750 

ACATTTCATT 

CAACTGTTGT 

CTCCTGTTCT 

GCAGCTGTTC 

TTGAACCTCA 

6800 

TCCCACTGAA 

TCTGAATTCT 

TTCAATTCGA 

TCAGTAATGA 

TTGTTCTAGC 

6850 

TTCTTGATTG 

CTGGTTTTGT 

TTTTCAAATT 

CTGGGCAGCA 

GTAATGAGTT 

6900 

CTTCCAATTG 

GGGGCGTCTC 

TGTTCCAAAT 

CTTGCAGTGT 

TGCCTTCTGT 

6950 

TTGATGATCA 

TTTCATTGAT 

GTCTTCCAGA 

TCACCCACCA 

TCACTCTCTG 

7000 

TGATTTTATA 

ACTCGATCAA 

GCAGAGACAG 

CCAGTCTGTA 

AGTTCTGTCC 

7050 

AAGCTCGGTT 

GAAGTCTGCC 

AGTGCAGGTA 

CCTCCAACAG 

CAAAGAAGAT 

7100 

GGCATTTCTA 

GTTTGGAGAT 

GACAGTTTCC 

TTAGTAACCA 

CAGATTGTGT 

7150 

CACTAGAGTA 

ACAGTCTGAC TGGCAGAGGC 

TCCAGTAGTG 

CTCAGTCCAG 

7200 

GGGCACGGTC 

AGGCTGCTTT 

GTCCTCAGCT 

CCCGAAGTAA 

ATGGTTTACA 

7250 

GCCTCCCACT 

CAGACCTCAG ATCTTCTAAC 

TTCCTCTTCA 

CTGCCTGAGT 

7300 

GCTTGGTTTT 

TCCTTATACA AATGCTGCCC 

TTTCGACAAA 

AGCCTTTCCA 

7350 

CATCCGCTTG 

TTTACCGTGA ACTGTTACTT 

CAATCTCCTT 

TATGTCAAAC 

7400 

GGTCCTGCCT 

GACTTGGTTG GTTATAAATT 

TCCAACTGGT 

TTCTAATAGG 

7450 

AGAGACCCAC 

AGAAGCAGGT GATCCAGCT6 

CTCTTCAAGC 

TGCCTAAAAT 

7500 

CTTTTAAGTG 

AACCTGAAGC 

TCTCCTTGTT 

TCTCAGGTAA 

AGCTCTGGAG 

7550 

ACCTTTATCC 

ACTGGAGATT TGTCTGTTTG 

AGCTTCTTTT 

CAAGTTTATC 

7600 
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FIGURE 12G 

TTGCTCTTCT GGCCTTATGG GAGCACTTAC AAGTACTGCT CCTCCTGTTT 7650 
CATTTAATTG TTTTAGAATT CCCTGGCGCA GGGGCAACTC TTCTGCCAGT 7700 
AACTTGACTT GTTCAAGTTG TTCTTTTAGC TGCTGCTCAT CTCCAAGTGG 7750 
AGTAATAGCA ATGTTATCTG CTTCTTCCAG CCACAAAACA AATTCATTTA 7800 
AATCTCTTTG AAATTCTGAC AAGACATTCT TTTGTTCTTC AATCCTCTTT 7850 
CTCCTTTCTG CCAGCTCTTT GCAGATGTCG TGCCACCGCA GACTCAAGCT 7900 
TCCTAATTTT TCTTGTAGAA TATTGACATC TGTTTTTGAA GACTGTTGAA 7950 

TTATTTCTTC CCCAGTTGCA TTCAGTGTTC TGACAACAGC TTGACGCTGC 8000 

CCAATGCCAT CCTGGAGTTC CTTAAGATAC CATTTGTATT TAGCATGTTC 8050 

CCAGTTTTCA GGATTTTGTG TCTTTTTGAA AAACTGTTCA ACTTCATTCA 8100 

GCCATTGATT AAATACCTTC ATATCATAAT GAAAGTGTCG CCATTTTTCA 8150 

ACTGATCTGT CGAATCGCCC TTGTCGTTCC TTGTACATTC TATGAAGTTT 8200 

TTCCCCCTGG AAATCCATCT GTGCCACGGC TTCCTGTACT TTCACCTTTT 8250 

CCATGGAGGT GGCACTTTGC AAGGCTGCTG TCTTCTTCTT GTGAATAATA 8300 

TCAATCCGAC CTGAGATTTG TTGCAAATTG TCTTTTATAT TCTTAAGAGA 8350 

CTCCTCTTGC TTAAAAAGAT CTTCAAAATC TTTAGCACAG AGTTCAGGAG 8400 

TATTTAGAAG ATGATCAACT TCTGAAAGAG CTTGTAAGAT ATGACTGATC 8450 

TCGGTCAAAT AAGTAGAAGG CACATAAGAA ACATCCAAAG GCATATCTTC 8500 

AGTCGTCACT ACCATAGTTT CTTCATGGAG AGTGTGAATT TGTGCAAAGT 8550 

TGAGTCTTCG AAACTGAGCA AAATTGCTCT CAATTTGCCG CCAGCGCTTG 8600 

CTGAGCTGGA TCTGAGTTGG CTCCACTGCC ATTGCGGCCC CATTCTCAGA 8650 

CAAGCCCTCA GCTTGCCTGC GCACTGCATT CAGCTCCTCT TTCTTCTTCT 8700 

GCAATTCACG ATCAATTTCC TTTAATTTTC TTTCATCTCT GGGTTCAGGT 8750 

AGGCTGGCTA ATTTTTTTTC AATTTCATCC AAGCATTTCA GGAGATCATC 8800 

AGCCTGCCTC TTGTACTGAT ACCACTGGTG AGAAATTTCT AGGGCCTTTT 8850 


JflUIIIUIfc SEETflHBfS) 


PCT7US95/14017 

WO 96/13597 

37/45 
FIGURE 12 H 

TTCTTCTTTG AGACCTCAAA TCCTTGAGAG CATTATGTTT TGTCTGTAAC 8900 

AGCTGCTGTT TTATCTTTAT TTCCTCTCGC TTTCTCTCAT CTGTGATTCT 8950 

TTGTTGTAAG TTGTCTCCTC TTTGCAACAA TTCATTTACA GTACCCTCAT 9000 

TGTCTTCACT CATATCTTTA TTGAAGTCTT CCTCTTTCAG ATTCACCCCC 9050 

TGCTGAATTT CAGCCTCCAG TGGTTCAAGC AATTTTTGTA TATCTGAGTT 9100 

AAACTGCTCC AATTCCTTCA AAGGAATGGA GGCCTTTCCA GTCTTAATTC 9150 

TGTGAGAAAT AGCTGCAAAT CGACGGTTGA GCTCAGAGAT TTGGGGCTCT 9200 

ACTACTTTCC TGCAGTGGTC ACCGCGGTTT GCCATCAATT TTGCTGCTTG 9250 

GTCACGTGTG GAGTCCACCT TTGGGCGCAT GTCATTCATT TCAGCCTTTA 9300 

AACGCTTAAG AATGTCTTCC TTTTGTTGTG GTTTCTTCTT TTCAGACTCA 9350 

TCTAAAAGTT CATCTGCATG AATGATCCAC TTTGTGATTT GTTCTATGTT 9400" 

CTGATCAAAG GTTTCCATGT GTTTCTGGTA TTCCAACAAA AGATTTAGCC 9450 

ATTCTTCTAC TCTGGAGGTG ACAGCTATCC AGTTACTGTT CAGAAGACTC 9500 

AGTTTATCTT CTACCAAGGT TTCTTTCTTG CCCAACACCA TTTTCAAAGA 9550 

CTCTCCTAAT TCTGTAACAC TCTTCAAGTG AGCCTTCTGT TTCTCAATCT 9600 

CTTTTTGAGT AGCCTTTCCC CAGGCAACTT CAGAATCCAA ATTACTTGGC 9650 

ATTCCTTCAA CTGCTGATCT CTTCGTCAAT TCTGTATCTG TTGCTGCCAG 9700 

CCATTCTGTT AAGACATTCA TTTCCTTTCT CATCTTACGG GACAACTTCA 9750 

AGGATTTCTC CAACTGTTGC TTTCTCTCTG TTACCTTCGC ACCCAACTCA 9800 

TTGTAATGCA ATTTCAAAGC TGTTACTCGT TCATCAAGCT CTTTGGGATT 9850 

TTCTGTCTGC TTTTTCTGTA CAATTTGACG TCCGGTTTTA ATCACCATTT 9900 

CCACTTCAGA CTTGACTTCA CTCAGGCTTT TATACAAGTT CACACAATGA 9950 

CTTAGTTGTG ACTGAATTAC TTCCTGTTCA ACACTCTTGG TTTCCAATGC 10000 

AGGCAAATGC ATCTTGACTT CATCTAAAAT CATCTTACTT TCCTCTAGAC 10050 

GTTGTTCAAA ATTGGCTGGT TTTTGGAATA ATCGAAATTT CATGGAGACA 10100 

TCTTGTAATT TTTTCTGTGC AACATCAATT TGTGAAAGAA CCCTTTGGTT 10150 
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FIGURE 121 

GGCATCCTTC CCCTGGTTAT GTTTCTTCAT TTCTTCTAAA CTTATCTCAT 10200 

GACTTGTCAA ATCTGATTGG ATTTTCTGGG CTTCCTGAGG CATTTGAGCT 10250 

GCATCCACCT TGTCAGTGAT ATAAGCTGCC AACTGCTTGT CAATGAATTC 10300 

AAGCGACTCC TGAATTAAGT GCAAGGACTT TTCAATTTCC TGGGCAGACT 10350 

GGATACTCTG TTCAAGCAAC TTTTGTTTCC TCACAGCCTC TTCATGTAGT 10400 

TCCCTCCAAC GAGAATTAAA CGTCTCAAGC TCCTCATTGA TCAGTTCATC 10450 

CATGACTCCT CCATCTGTAA GAGTCTGTGC CAATAGACGA ATCTGATTTG 10500 

GGTTCTCCTC TGAATGATGC ATCAGATTTT CAAGAGATTC TAGCACTTCA 10550 

GTGATTTCCT CAGGTCCTGC AGGAACATTT TCCATGGTTT TAAGTTTCAA 10600 

TTCTACTTCA TTGAGCCACT TGTTTGCTTT CTCTAAATAT GACAATAACT 10650 

CATGCCAACA TGCCCAAACT TCTTCCAAAG TTTTGCATTT TCCATTCAGC 10700 

CTGGTGCACA GCCATTGGTA GTTGGTGGTC AGAGTTTCAA GTTCCTTTTT 10750 

TAAGGCCTCT TGTGCTGAGG GTGGAGCGTG AGCTATTACA CTATTTACAG 10800 

TCTCAGTAAG GAGTTTCACT TTAGTTTCTT TTTGTAGTGC CTCTTCTTTA 10850 

GCTCTCTTCA TTTCTTCAAC AGCAGTCTGT AATTCATCTG GAGTTTTATA 10900 

TTCAAAATCT CTCTCTAGAT ATTCTTCTTC AGCTTGTGTC ATCCACTCAT 10950 

GCATCTCTGA TAGATCTTTT TGGAGGCTTA CGGTTTTATC CAAACCTGCC 11000 

TTTAAGGCTT CCTTTCTGGT GTAGACCTGG CGGCATATGT GATCCCACTG 11050 

AGTGTTAAGC TCTCTAAGTT CTGTCTCCAG TCTGGATGCA AACTCAAGTT 11100 

CAGCTTCACT CTTTATCTTC TGCCCACCTT CATTAACACT ATTTAAACTG 11150 

GGCTGAATTG TTTGAATATC ACCAACTAAA AGTCTGGATT GTTTGAGCTG 11200 

TTTTTTCAGG ATTTCAGCAT CCCCCAGGGC AGGCCATTCC TCTTTCAGGA 11250 

AAACATCAAC TTCAGCCATC CATTTCTGTA AGGTTTTTAT GTGATTCTGA 11300 

AATTTTCGAA GTTTATTCAT ATGTTCTTCT AGCTTTTGGC AGCTTTCCAC 11350 

CAACTGGGAG GAAAGTTTCT TCCAGTGCCC CTCAATCTCT TCAAATTCTG 11400 
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ACAGATATTT 
ACAGTGTCAC 
TTGCAGAGCC 
TAACACTAAG 
GTCCTGATGC 
AAAAATTGTC 
GGTTAAAATG 
CCCTGTCCCT 
TTGAGGCTGA 
ACTGGCTTTT 
AACAAGTTTP 
ATAAAAGGTA 
TCAGCAATTG 
TGTCTGATAC 
GGCCTGAGCT 
TTTCTCGTGC 
TCTTTTCGAT 
AGTAATCCAA 
GTTCAGAATC 
AGTTCCTCTT 
AGTTACCGTT 
TGACAGCCTG 
TTGTCCCAAC 
CACTGACTTA 
GTTTTTCCAT 


CTGGCATATT 
TCAGATAGTT 
TGTAATTTCC 
ATAAGGTACA 
TACTCATTGT 
TGTAGCTCTT 
ATTAGTAAAG 
TTTCTTTCAG 
AGAGCTGACA 
AATTGCTGTT 
CGGCAGTAGT 
ATGATGTTGG 
GCAGAATTCT 
TTTCAGCATT 
GATCTGCTGG 
TATGGCATTG 
AGACTGCAAA 
CTGTGAAGTT 
CACAGTTATC 
GGGCATGTTT 
TCCATTACAG 
TGAAATTTGT 
GTTGTGCAAA 
TTTTTCAGTG 
GGTTGGCTTT 


TCTGAAGGTG 
GAAGCCATTT 
CGAGTCTCTC 
GAGAGTTTGC 
CTCCTGATAG 
TCTCTTTGGC 
GCCACAAAGT 
TTGTAGACTC 
ATCTGTTGAC 
GGCTCTGATA 
TGTCATCTGT 
TTTGATACTC 
GTCCACCGGC 
AACACCCTCA 
CATCTTGCAG 
ACTTTTTCTT 
TTCAGAACTC 
CAGTTATATC 
TGCCTCTTCT 
TACCATGATT 
TTGTCTGTGT 
GCTGAACTCT 
GTTTTCCATC 
CCGAAAGTAG 
TTCTTTTCTA 


CTTTCTTGGC 
TGTTGCTCTT 
CTCCATTATT 
TTTCTGACTG 
CGCATTGGTG 
CCTCACACCA 
CTGCATCCAG 
TGAATTTTTA 
TTCATCCTTA 
GGGTGGTAGA 
TCCAATTGTT 
TAGCCAGTTA 
TGTTCAGTTG 
TTTGCCATCT 
TTTTCTGAAC 
GCAAGTCTGA 
TGTAATACAG 
GACATCCAAC 
TTTGAGGAGG 
TGTTCCCTTG 
TAGGGATGGT 
TTTCAAGTTT 
CAGATTTCCA 
ATCTTGATTG 
GATCTATTTT 


CATCTCCTTC 
TCAAAGAACT 
TCATATTCAG 
CTGGATCCAC 
GTAAAGTGTC 
TCAAAGATGT 
AAACATTGGC 
ATTGCTCAAT 
CAAATTTTTA 
CTGGGTTTTC 
GTAGCTGATT 
ACTCTCTCAC 
TTCTGAAGCT 
GTTCCACCAG 
TTCTCTGCTT 
GATGTTGCCT 
CTTCTGAACG 
CTTTTCCTGA 
TGGTGGTGGA 
TGGTCACCAT 
TGAGTGGTGG 
TTGGGTTAAA 
TCTTTTGAGT 
AGTGAACTTA 
TAAAGTAGAT 
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ATTTTGTGAA 
CTGAATGTTC 
GGCACTGTTC 
ACAATCCAGC 
CAGTACCTTA 
ATTCATCAAC 
TCCTGTTCTA 
TAGATCTTCA 
TCTCTTCAGT 
TGGAGATCCA 
TACCCTGAGA 
CTTCAGCTTC 
CCTAACTGTA 
CATGAATCCC 
TTGAAATCTC 
GAAAGTACTT 
CTCCATCAAT 
GTGAAGGATA 
TGTGTGAAGG 
TTCATAGCCC 
GGTGATGTAA 
GGCAACATTT 
AACTTGAAAG 
AAGTGGTAGC 
CATTTTGCAA 


GACTTGACAT 
TTCATTGCAT 
TTCAGTAAAA 
GGTCTTCAGT 
AGTTGTTCTT 
CACTACTACC 
GATCTTCTTG 
AGATCAGGTC 
TTTTGTTAAC 
TTAGAACTTT 
CATTCCCATC 
TTCATCTTCT 
GAACATTACC 
TCATGAGCAT 
TCCTTGTGCT 
CTTCTAAAGC 
GAACTGTCAA 
GGGGCTCTGT 
CATAACTCTT 
TGTGCTAGAC 
TTGAAAATGT 
CCACTTCTTG 
AGTGATGTGA 
AACATCTTGA 
TGTTGAAGGC 


CATTTCATTT 
CTTCTTTTTC 
TGCTGCCATT 
CCATCTGCAG 
CCAAAGCAGC 
ATGTGAGTGA 
AAGCACCTTA 
CAAAGGGCTC 
CAGTCATCTA 
GTGTAATTTG 
TTGAATTTAG 
GATAATTTCC 
AACAAGTCCT 
GAAACTGTTC 
CGCAATGTAT 
AGTTTGGTAA 
GTGACTTGTC 
GTGGAATCAG 
GAATCGAGGC 
TGACTGTGAT 
TCTTCTCTAG 
AATGGCTTCA 
TGTACATTAA 
GGATCAAGAA 
ATGTTCCAGT 


TGATCTTTAA 
TGAAAGCCAT 
TTAGAAGAAT 
ATATTTGCCC 
TGTTGCATGA 
GCGAGTTGAC 
TGTTGTTGTA 
TTCCTCCATT 
GTTCTTTTAA 
CTTTGTTTTT 
GAGATTCATT 
CTTTTCCAAC 
TGATGAGATG 
TTTCACTTCT 
CCTCGGCAGA 
CTATCCAGAT 
TCTGGGAGCT 
AGGTGGCAAC 
TTAGGAGATG 
CTGTTGAGAG 
TTACTTTTGA 
ATGCTCACTT 
GATGGACTTC 
GTTTTTCTAT 
CTTTGGGTGG 


AGCCACTTGT 
GTACTAAAAA 
ATCTTGTAAA 
ATCGATCTCC 
TCACCGCTGG 
CCTGACCTGC 
CTTGGCATTT 
TTCTTAGTTC 
TTTCTGATTC 
CCATGCTAGC 
TGTTCTTGCA 
TAGTTGACTT 
TCAGATCCAT 
TCAACATCAT 
AAGAAGCCAT 
TTACTTCCGT 
TCCAAATGCT 
ATAAGCAGCC 
AAGAAGTTTG 
TAATGCATCT 
AGATGTCCTG 
GTTGTGGCAA 
TTGTCTGGAT 
GCCTAACTGG 
CTGAGTGCTG 
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FIGURE 12L 

TGAAACCACA CTATTCCAAT CAAACAGGTC GGCCCTGTGA CTATGGATAA 
GAGCATTCAA AGCCAACCCG TCGGACCAGC TAGAGGTGAA GTTGATGACG 
TTAACCTGTG GATAATTACG TGTTGACTGT CGAACCCAGC TCAGAAGAAT 
CTTTTCACTG TTGGTTTGCT GCAATCCAGC C TGATAGTT TTCATCACAT 
TTTTGACCTG CCAGTGGAGG ATTATATTCC AAATCAAACC AAGAGTGAGT 
TTATGATTTC CATCCACTAT GTCAGTGCTT CCTATATTCA CTAAATCAAC 
ATTATTTTTC TGTAAGACCC GCAGTGCCTT GTTGACATTG TTCAGGGCAT 
GAACTCTTGT AGATCCCTTT TCTTTTGGCA GTTTTTGCCC TGTAAGGCCT 
TCCAAGAGGT CTAGGAGGCG TTTTCCATCC TGCAGGTCAC TGAAGAGGTT 
GTCTATGTGT TGCTTTCCAA ACTTAGAAAA TTGTGCATTT ATCCATTTTG 
TGAATGTTTT CTTTTGAACA TCTTCTCTTT CATAACAGTC CTCTACTTCT 
TCCCACCAAA GCATTTGGAA GAAAAAGTAT ATATCAAGGC AGGGATAAAA 
ATCTTGGTAA AAGTTTCTCC CAGTTTTATT GCTCCAGGAG GCTTAGGTAC 
GATGAGAAGC CAATAAACTT CAGCAGCCTT GACAAAAAAA AAAAAAAAAA 
TAGCACTTCA AGTCTTCCTA TTCGTTTTTT CTATAAAGCT ATTGCCTTCA 
AGAGCGGAAT TCCTGCAGCC CGGGGGATCC ACTAGTTCTA GAGCGGCCGC 
GGGTACAATT CCGCAGCTTT TAGAGCAGAA GTAACACTTC CGTACAGGCC 
TAGAAGTAAA GGCAACATCC ACTGAGGAGC AGTTCTTTGA TTTGCACCAC 
CACCGGATCC GGGACCTGAA ATAAAAGACA AAAAGACTAA ACTTACCAGT 
TAACTTTCTG GTTTTTCAGT TCCTCGAGTA CCGGATCCTC TAGAGTCCGG 
AGGCTGGATC GGTCCCGGTG TCTTCTATGG AGGTCAAAAC AGCGTGGATG 
GCGTCTCCAG GCGATCTGAC GGTTCACTAA ACGAGCTCTC CTTATATAGA 
CCTCCCACCG TACACGCCTA CCGCCCATTT GCGTCAATGG GGCGGAGTTG 
TTACGACATT TTGGAAAGTC CCGTTGATTT TGGTGCCAAA ACAAACTCCC 
ATTGACGTCA ATGGGGTGGA GACTTGGAAA TCCCCGTGAG TCAAACCGCT 
ATCCACGCCC ATTGATGTAC TGCCAAAACC GCATCACCAT GGTAATAGCG 
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FIGURE 12M 

ATGACTAATA CGTAGATGTA CTGCCAAGTA GGAAAGTCCC ATAAGGTCAT 15250 

GTACTGGGCA TAATGCCAGG CGGGCCATTT ACCGTCATTG ACGTCAATAG 15300 

GGGGCGTACT TGGCATATGA TACACTTGAT GTACTGCCAA GTGGGCAGTT 15350 

TACCGTAAAT ACTCCACCCA TTGACGTCAA TGGAAAGTCC CTATTGGCGT 15400 

TACTATGGGA ACATACGTCA TTATTGACGT CAATGGGCGG GGGTCGTTGG 15450 

GCGGTCAGCC AGGCGGGCCA TTTACCGTAA GTTATGTAAC GACCTGCAGG 15500 

TCGACTCTAG AGGATCTCCC TAGACAAATA TTACGCGCTA TGAGTAACAC 15550 

AAAATTATTC AGATTTCACT TCCTCTTATT CAGTTTTCCC GCGAAAATGG 15600 

CCAAATCTTA CTCGGTTACG CCCAAATTTA CTACAACATC CGCCTAAAAC 15650 

CGCGCGAAAA TTGTCACTTC CTGTGTACAC CGGCGCACAC CAAAAACGTC 15700 

ACTTTTGCCA CATCCGTCGC TTACATGTGT TCCGCCACAC TTGCAACATC 15750 

ACACTTCCGC CACACTACTA CGTCACCCGC CCCGTTCCCA CGCCCCGCGC 15800 

CACGTCACAA ACTCCACCCC CTCATTATCA TATTGGCTTC AATCCAAAAT 15850 

AAGGTATATT ATTGATGATG CTAGCGGGGC CCTATATATG GATCCAATTG 15900 

CAATGATCAT CATGACAGAT CTGCGCGCGA TCGATATCAG CGCTTTAAAT 15950 

TTGCGCATGC TAG CTATAGT TCTAGAGGTA CCGGTTGTTA ACGTTAGCCG 16000 

GCTACGTATA CTCCGGAATA TTAATAGGCC TAGGATGCAT ATGGCGGCCG 16050 

GCCGCCTGCA GCTGGCGCCA TCGATACGCG TACGTCGCGA CCGCGGACAT 16100 

GTACAGAGCT CGAGAAGTAC TAGTGGCCAC GTGGGCCGTG CACCTTAAGC 16150 

TTGGCACTGG CCGTCGTTTT ACAACGTCGT GACTCGGAAA ACCCTGGCGT 16200 

TACCCAACTT AATCGCCTTG CAGCACATCC CCCTTTCGCC AGCTGGCGTA 16250 

ATAGCGAAGA GGCCCGCACC GATCGCCCTT CCCAACAGTT GCGCAGCCTG 16300 

AATGGCGAAT GGCGCCTGAT GCGGTATTTT CTCCTTACGC ATCTGTGCGG 16350 

TATTTCACAC CGCATACGTC AAAGCAACCA TAGTACGCGC CCTGTAGOGG 16400 

CGCATTAAGC GCGGCGGGTG TGGTGGTTAC GCGCAGCGTG ACCGCTACAC 16450 
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TTGCCAGCGC 

CCTAGCGCCC 

GCTCCTTTCG 

CTTTCTTCCC 

TTCCTTTCTC 

16500 

GCCACGTTCG 

CCGGCTTTCC 

CCGTCAAGCT 

CTAAATCGGG 

GGCTCCCTTT 

16550 

AGGGTTCCGA 

TTTAGTGCTT 

TACGGCACCT 

CGACCCCAAA 

AAACTTGATT 

16600 

TGGGTGATGG 

TTCACGTAGT 

GGGCCATCGC 

CCTGATAGAC 

GGTTTTTCGC 

16650 

CCTTTGACGT 

TGGAGTCCAC GTTCTTTAAT 

AGTGGACTCT 

TGTTCCAAAC 

16700 

TGGAACAACA 

CTCAACCCTA TCTCGGGCTA 

TTCTTTTGAT 

TTATAAGGGA 

16750 

TTTTGCCGAT 

TTCGGCCTAT 

TGGTTAAAAA 

ATGAGCTGAT 

TTAACAAAAA 

16800 

TTTAACGCGA 

ATTTTAACAA 

AATATTAACG 

TTTACAATTT 

TATGGTGCAC 

16850 

TCTCAGTACA 

ATCTGCTCTG 

ATGCCGCATA 

GTTAAGCCAG 

CCCCGACACC 

16900 

CGCCAACACC 

CGCTGACGCG 

CCCTGACGGG 

CTTGTCTGCT 

CCCGGCATCC 

16950 

GCTTACAGAC 

AAGCTGTGAC 

CGTCTCCGGG 

AGCTGCATGT 

GTCAGAGGTT 

17000 

TTCACCGTCA 

TCACCGAAAC 

GCGCGAGACG 

AAAGGGCCTC 

GTGATACGCC 

17050 

TATTTTTATA 

GGTTAATGTC 

ATGATAATAA 

TGGTTTCTTA 

GACGTGAGGT 

17100 

GGCACTTTTC 

GGGGAAATGT 

GCGCGGAACC 

CCTATTTGTT 

TATTTTTCTA 

17150 

AATACATTCA 

AATATGTATC 

CGCTCATGAG 

ACAATAACCC 

TGATAAATGC 

17200 

TTCAATAATA 

TTGAAAAAGG 

AAGAGTATGA 

GTATTCAACA 

TTTCCGTGTC 

17250 

GCCCTTATTC 

CCTTTTTTGC 

GGCATTTTGC 

CTTCCTGTTT 

TTGCTCACCC 

17300 

AGAAACGCTG 

GTGAAAGTAA AAGATGCTGA 

AGATCAGTTG 

GGTGCACGAG 

17350 

TGGGTTACAT 

CGAACTGGAT 

CTCAACAGOG 

GTAAGATCCT 

TGAGAGTTTT 

17400 

CGCCCCGAAG 

AACGTTTTCC 

AATGATGAGC 

ACTTTTAAAG 

TTCTGCTATG 

17450 

TGGCGCGGTA 

TTATCCCGTA TTGACGCCGG 


CTCGGTCGCC 

17500 

GCATACACTA 

TTCTCAGAAT 

GACTTGGTTG 

AGTACTCACC 

AGTCACAGAA 

17550 

AAGCATCTTA 

CGGATGGCAT GACAGTAAGA 

GAATTATGCA 

GTGCTGCCAT 

17600 

AACCATGAGT 

GATAACACTG 

CGGCCAACTT 

ACTTCTGACA 

ACGATCGGAG 

17650 

GACCGAAGGA 

GCTAACCGCT TTTTTGCACA 

ACATGGGGGA 

TCATGTAACT 

17700 
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FIGURE 120 

CGCCTTGATC GTTGGGAACC GGAGCTGAAT GAAGCCATAC CAAACGACGA 
GCGTGACACC ACGATGCCTG TAGCAATGGC AACAACGTTG CGCAAACTAT 
TAACTGGCGA ACTACTTACT CTAGCTTCCC GGCAACAATT AATAGACTGG 
ATGGAGGCGG ATAAAGTTGC AGGACCACTT CTGCGCTCGG CCCTTCCGGC 
TGGCTGGTTT ATTGCTGATA AATCTGGAGC CGGTGAGCGT GGGTCTCGCG 
GTATCATTGC AGCACTGGGG CCAGATGGTA AGCCCTCCCG TATCGTAGTT 
ATCTACACGA CGGGGAGTCA GGCAACTATG GATGAACGAA ATAGACAGAT 
CGCTGAGATA GGTGCCTCAC TGATTAAGCA TTGGTAACTG TCAGACCAAG 
TTTACTCATA TATACTTTAG ATTGATTTAA AACTTCATTT TTAATTTAAA 
AGGATCTAGG TGAAGATCCT TTTTGATAAT CTCATGACCA AAATCCCTTA 
ACGTGAGTTT TCGTTCCACT GAGCGTCAGA CCCCGTAGAA AAGATCAAAG 
GATCTTCTTG AGATCCTTTT TTTCTGCGCG TAATCTGCTG CTTGCAAACA 
AAAAAACCAC CGCTACCAGC GGTGGTTTGT TTGCCGGATC AAGAGCTACC 
AACTCTTTTT CCGAAGGTAA CTGGCTTCAG CAGAGCGCAG ATACCAAATA 
CTGTTCTTCT AGTGTAGCCG TAGTTAGGCC ACCACTTCAA GAACTCTGTA 
GCACCGCCTA CATACCTCGC TCTGCTAATC CTGTTACCAG TGGCTGCTGC 
CAGTGGCGAT AAGTCGTGTC TTACOGGGTT GGACTCAAGA CGATAGTTAC 
CGGATAAGGC GCAGCGGTCG GGCTGAACGG GGGGTTCGTG CACACAGCCC 
AGCTTGGAGC GAACGACCTA CACCGAACTG AGATACCTAC AGCGTGAGCT 
ATGAGAAAGC GCCACGCTTC CCGAAGGGAG AAAGGCGGAC AGGTATCCGG 
TAAGCGGCAG GGTCGGAACA GGAGAGCGCA CGAGGGAGCT TCCAGGGGGA 
AACGCCTGGT ATCTTTATAG TCCTGTCGGG TTTCGCCACC TCTGACTTGA 
CCGTCGATTT TTGTGATCCT CGTCAGGGCG GCGGAGCCTA TGGAAAAACG 
CCAGCAACGC GGCCTTTTTA CGGTTCCTGG CCTTTTGCTG GCCTTTTGCT 
CACATGTTCT TTCCTGCGTT ATCCCCTGAT TCTGTGGATA ACCGTATTAC 
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FIGURE 12P 

CGCCTTTGAG TGAGCTGATA CCGCTCGCCG CAGCCGAACG ACCGAGCGCA 19000 

GCGAGTCAGT GAGCGAGGAA GCGGAAGAGC GCCCAATACG CAAACCGCCT 19050 

CTCCCCGCGC GTTGGCCGAT TCATTAATGC AGCTGGCACG ACAGGTTTCC 19100 

CGACTGGAAA GCGGGCAGTG AGCGCAACGC AATTAATGTG AGTTAGCTCA 19150 

CTCATTAGGC ACCCCAGGCT TTACACTTTA TGCTTCCGGC TCGTATGTTG 19200 

TGTGGAATTG TGAGCGGATA ACAATTTCAC ACAGGAAACA GCTATGACCA 19250 

TGATTACGAA TTCGAATGGC CATGGGACGT CGACCTGAGG TAATTATAAC 19300 
CCGGGCC 


19307 
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